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CHAP TER I 


DEFINITIONS AND FUNDAMENTAL OPERATIONS 
OF MATRICES 

1 . Introductory 

The notation of ordinary algebra is a convenient system of 
shorthand, a compact and well-adapted code for expressing 
the logical relations of numbers. The notation of matrices 
is merely a later development of this shorthand, by which 
certain operations and results of the earlier system can be 
expressed at still shorter hand. The rules of operation are 
so few and so simple, so like those of ordinary algebra, the 
notation of matrices is so concise yet so flexible, that it 
has seemed profitable to begin this book with a brief 
account of matrices and matrix algebra, and to derive the 
theory of determinants by the aid of matrix notation, in 
an order suggested by a naturally alternating development 
of both subjects. This first chapter is devoted to explain- 
ing the code. The reader is invited to take it with due 
deliberation, to invent at all times and verify examples for 
himself, especially in regard to the transposition of matrix 
products and to the multiplication of partitioned matrices, 
and, at first, to write out results both in ordinary and 
in matrix notation for comparison. The confidence and 
facility acquired by such practice will prove to be of 
constant service during the study of the later chapters. 

2. Linear Equations and Transformations 

The theory of matrices and determinants originates in 
the necessity of solving simultaneous linear equations and 
of dealing in a compact notation with linear transformations 
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from one set of variables to a second set. In the very 
early stages of elementary algebra we meet simple equations 
of the first degree, of the form 



ax = h. 

* 

(i) 

At later stages 

we meet simultaneous equations in 

two 

unknowns, 

a^x+hy — 

a 2 x+b 2 y = h 2 , 

• * 

(2) 

in three unknowns, 




a^+btf+CjZ = 
a 2 x+b 2 y+c z z == h 2 , 
a^x+b^+CsZ = h 3 , 

m • 

(3) 


and so on. The method of solving by successive elimina- 
tions, and conditions under which unique solutions exist, 
may perhaps be known to the reader. 

At a still later stage, in co-ordinate geometry of two 
dimensions, we encounter various linear transformations, 
such as x — x' cos 9— y r sin 0, 

y = x' sin Q+y' cos 9 , ' 


(4) 


representing a change of rectangular axes by rotation 
about the origin through an angle 9 , and’ in the three- 
dimensional analogue of this we meet with 

X = lyX +4^ 9 

y ^m&'+mty'+rnQz', . . ( 5 ) 

2 = n^x' +n 2 y' +n 3 z 

where the l i} m jy n k are direction cosines. Indeed every- 
where in mathematics we are confronted with equations of 
linear transformation ; and this in itself is enough to justify 
the search for a code and a calculus. 

The general set of m simultaneous equations in n 
unknowns is 

+ a 12*^2 +•••+%»#« = h l9 

^21^1+^22^2 + *--+&2n a; « = &2> 


a ml x l J r a m2 X 2 
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and the general linear transformation expressing m vari- 
ables y v y 23 y m as linear functions of n variables x v x 2 , 

x n is the same as this in form, but with the variables y 
replacing the constants h on the right of the equations (6). 

The number m of variables y has been taken as possibly 
different from the number n of variables x. Such trans- 
formations of unequal numbers of variables can easily 
arise in practice, for example in questions involving 
perspective drawing, where a three-dimensional object 
may have to be represented on, a two-dimensional sheet of 
paper. Assigned points in the object have three co- 
ordinates, the representative points on the paper have 
two coordinates only, and the representation is given 
algebraically by a set of equations in which m = 2, n = 3. 
• 

3. The Notation of Matrices 

It would be intolerably tedious if, whenever we had 
occasion to manipulate sets of equations or to refer to 
properties of the coefficients, we had to write either the 
equations or the scheme of coefficients in full. The need 
for an abbreviated notation was early felt, and in the last 
century Cayley and other algebraists of the time made use 
of contracted notations such as 



Ct>i2 . « . . 


a zi 

^22 * * * * 

. a 2n 


a m2 . . . 

®mn 


*i 

x 2 


Vi 

V2 






Vm 


(1) 


for a set of linear equations, detaching the rectangular 
scheme of coefficients a i5 from the variables x i to which 
they referred. Later Cayley, by regarding such a scheme 
of ordered coefficients as an operator acting upon the 
variables aq, x 2} ..., x n in much the same way as a acts 
upon x to produce ax, and by investigating the rules of 
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such operations, formulated the algebra of matrices, 
meaning by matrices those schemes of detached co- 
efficients considered as operators. The theory was at 
first confined to square matrices, but the inclusion of 
general rectangular matrices increases greatly the scope 
and convenience of application. 

Definition. A scheme of detached coefficients a ih 
set out in m rows and n columns as on the left of (1), will 
be called a matrix of order mbj w, ormx n. The numbers 
a €j are called elements (by some writers constituents , by 
others coordinates) of the matrix, a {j being the element 
in the i th row and j th column. The row-suffix i ranges over 
the values 1, 2, m, the column-suffix j over the values 
1, 2, n. The matrix as a whole will be denoted by A 
or by Iflif ] ; or on occasion will be written out in full array. 
The element will often be called the (i, j) th element of A. 

When once the rules are found by which matrices A 
and B can be added, subtracted, multiplied and divided, a 
proper sense being given to these operations through study 
of the laws obeyed by linear transformations, the materials 
requisite for an algebra will be available. This algebra, the 
algebra of matrices, has a very close resemblance to the 
algebra of ordinary numbers ; but it is a more general 
algebra, and the reader must be on guard, at first, against 
carrying over into it some of the more facile habits acquired 
in ordinary algebra. 

4. Matrices, Row Vectors, Column Vectors, 
Scalars 

A matrix may possibly consist of a single row, or of a 
single column, of elements. For example in 3 (1) we see 
on the left a column of elements x v x 2 , ..., x ni in fact 
a matrix of order nx 1, and on the right a column of 
elements y v y 2 > V™ a matrix of order mx 1. Matrices 
of single row or single column type are of very common 
occurrence, and the general matrix itself may be viewed 
(11, Ex. 2) as an array of juxtaposed rows, or of juxtaposed 
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columns. It is therefore convenient to distinguish row 
and column matrices by a special name and notation. We 
shall call them vectors, or more precisely row vectors nnd 
column vectors , and we shall denote them by small italic 
letters, the order, such as mx 1 or nx 1, being always 
understood from the context. For example, the two 
column vectors in 3 (1) will be written as x and y ; and 
a device will be given later (8) for distinguishing a row 
vector from the column vector having the same elements. 
On occasion vectors may be written in full in row or 
column form with square brackets ; but often, to 
economize in vertical space on the page, it will be con- 
venient to write the elements of a column vector in hori- 
zontal alignment and to indicate, by the use of curled 
instead of square brackets, that a vertical alignment is 
intended. For example, we shall write column vectors as 

x = (*i *2 — x n), y = fe/i Vi ... y m }, ■ • (i) 

and row vectors as 

u = [u x u 2 ... v = [v x v 2 ... vj. . . (2) 

In every situation in which vectors are being used it is 
essential to keej) in mind, to visualize as it were, what 
kind of vector, whether row or column, is in question, and 
on no account to confuse the two kinds. 

The matrix of order lxl, that is, of one row and one 
column, is a single element. It will be found, as might 
have been anticipated, that the laws of operation of such 
matrices do not differ from those of ordinary numbers 
used as multipliers. We shall therefore identify such 
matrices with ordinary numbers or scalars. 

To sum up, the matrix A === [a^] is the scheme of 
detached coefficients in some actual or possible linear 
transformation. It is not inert, but is to be imagined as 
an operator. It is also to be regarded as a complete exitity, 
like a position in chess. If, for example, we interchange 
any of its rows, or its columns, what we obtain by so doing 
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is in general a different matrix. Two matrices A and B are 
considered to be equal only when they are of the same 
order mxn, and when all corresponding elements agree, 
that is to say when a i} ~ b {j for all i, j. 

We proceed to develop the algebra of matrices ; but 
we shall find, when we come to consider what can be 
meant by the division of one matrix by another, that we 
are forced to turn aside and to study, define and evaluate 
a related set of numbers, namely the determinants corre- 
sponding to square matrices. 


5. The Operations of Matrix Algebra 

Addition. Consider for illustration the 
formations 


and 


Vi = a n xi+a n x tz +a iz x^ 

Vi = ^21*^1 “1" a 22 X 2 ~S~ a 23*^3 > 


Z 1 — bll x l~\~bi2p2~^blZ X 3’ 
Z 2 — &2l a 'l"ff^22 3: 2"l"^23*^3> 


trans- 
* (1) 
• (2) 


and suppose that new variables w x and w 2 are introduced 
by adding the corresponding y i and z u thus : 

= Vx+h> w 2 = y 2 +z 2 . . . (3) 

Then we have at once 

W l = Kl+^n)^l + ( a l 2 +^12)^2 + ( a l3+^13) a *3» tA\ 

= ( a 21 4 *^ 21 )^! “t“(®22 “(■^22)*^2 ”K^ 23 “b ^23^3* 

The process of obtaining (4) from (1) and (2) may 
logically be regarded as the addition of linear trans- 
formations, and may evidently be extended to the ease of 
two sets of m equations, having the same n variables x { 
on the right. The rule of Matrix Addition is thus suggested : 

Addition of Matrices. To add together two matrices 
A and B of the same order mxn, we add their corresponding 
elements, and take the sums as the corresponding dements of 
the sum matrix, which is denoted by A -j -B. In symbols, 

A+B = [%]+[&**] = [fffi+M- . . (5) 
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The rule can now be extended step by step (or could 
have been postulated at once) to give the sum, in the 
sense just described, of any finite number of matrices of 
the same order mxn, thus : 

... ... + 7 %]. . - ( 6 ) 

For example, 


' 3-4 2" 

r 

2 6 — r 


'5 

2 1' 

— 1 0 5 

+ i 

4 -3 2_ 

— 

.3 

-3 7 


Scalar Multiplication. Taking again equations (1), 
let us suppose that the scale of measure of y 1 and y 2 is 
altered in the ratio 1 : A, by the introduction of new 
variables z x = A y v z 2 = Xy 2 . It follows that 

Z 1 — /ry\ 

Z 2 = \a 2X X x -]~ A<2o 2 ^2~H'^'®23‘^3* 

This operation of multiplying variables by a constant 
scale-factor may properly be called Scalar Multiplication, 
and the rule for it is evidently this : 

To multiply a matrix A by a scalar number A, we 
multiply all elements <% by A. In symbols, 

XA = A[<%] = [A <%]. . . (8) 

Linear Combination of Matrices. Now combining 
the rule of addition with that of scalar multiplication we 
have the rule for linear combination, with scalar coefficients, 
of any finite number of matrices A, B, ..., K of the same 
order mxn: 

aA~{-l3B-\- ... ••• - (9) 

where a, /?, k are scalar numbers. 

Null or Zero Matrix. At tins stage we can introduce 
the null or zero matrix, defined by the particular linear 
combination A— A and denoted by 0. Whether rect- 
angular or square, vector or scalar, it is seen to have all 
its elements zero. If it is of the same order as A, we have 
A +0 = A . 
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6. Matrix Multiplication. Pre- and 
Fostmultiplication 

The simplest homogeneous linear transformation is of 
the form y = ax, an ordinary multiplication, and when in 
elementary algebra we have two of these such as y = ax, 
z = by to perform in succession, the result may be written 
as the single transformation z = bax, the coefficients b 
and a of the separate transformations being combined in 
product to give the coefficient ba of the resultant trans- 
formation. When we perform successive linear trans- 
formations like this not on one but on many variables 
at a time we shall indulge in a natural extension of 
language and we shall say that the transformations are 
multiplied together in a certain order ; we shall further 
call the matrix of the single resultant transformation the 
product matrix BA, by analogy with ba. To discover the 
appropriate rule for the elements of BA, let us consider 
the two transformations, ^ = 

Vi = + 1 +++ 2 +++ 3 +. *2 = 


— ^ 21 ^ 1 +^ 22 ^ 2 +^ 23^3 ’ z 3 


+l?/l + +2?/2> 
^ 41 ^ 1 +^ 42 ^ 2 * 


(1) 


These, as may be seen at once by substituting for the 
Pi in terms of the x k , yield the single resultant trans- 
formation 

\*i 

+ = (+i+i + +2+l)+ + (+1+2 + ^2^22)% + (+ 1+3 ++2++ r 3> 
%2 ==: (+1+1 + +2+1 )+. + (+1+2 ++2^ 22) ^2 ‘ (+1^ 13 ? +2*++ffi» 
= (+1+1 + +2+1)+ + (+1+2 + +2+2)+ T (+i r * 13 + +■/* 23 )•* 31 ( 
Z A — (+l+l + +2+l)+ + (+l+2++2+2) ;i? 2 + (+1+3 + +•/* ashp- 
it is natural, therefore, to regard the matrix of the 
transformation (2), namely, 

+1+1++2+1 +1+2 ++2+2 +1+3 ++2+3 

+1+1++2+1 +1+2 ++2+2 +1+3 + +2+3 

+1+1++2+1 +1+2 + +2+2 +1+3 + +2+3 

+ 1+1 + &42 +1 + 1+2 + +: 2 a 22 ^ 41+3 U ^ 42^23 


( 3 ) 
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„ 

as the product ; in the sense just de&dri 

^11 ^12 
&21 ^22 
&31 &32 
&AI ^42 

taken in the order BA . The principle dictating the choice 
of order is this : a transformation is an operation or 
operator , acting on certain variables : the symbol of 
operation will always immediately precede the variables 
it affects, the operand. Thus when the variables x are 
transformed by A , the resulting variables are naturally 
symbolized by Ax ; and when these new variables are 
in turn transformed by B, it is natural to write B{Ax) 
for the outcome, and therefore natural to write BA for 
the matrix resulting from the two transformations in 
order. 

,v 

^ By inspection we infer the general rule of 

Matrix Multiplication. The element in the i th row 
and the j th column of the product matrix BA is obtained 
by multiplying the elements in the i th row of B into the 
corresponding elements in the j th column of A, and summing 
the products so obtained. In symbols, if B is of order 
mxn and A is of order nxp then BA = 0 , where 0 is 
of order mxp, and 

n 

Ctf = „ . * (5) 

*=i 

It is important to note that multiplication is possible 
only if the number of columns in B is the same as the 
number of rows in A. 

The reader should pause here to satisfy himself that 
he has grasped this fundamental rule. He should write out, 
in their literal fullness, a few products BA and AB for 
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small values of m, n and p, and should construct and worK 
out for himself some numerical examples also, such as 


1 

-2 3 


r j 3 i 


9 

15 

4 

2 5 


i 

i 

to l- 

c 

1 


4 

8 


It is to be observed that the typical element c {j of BA 
is itself a matrix product ; in fact (5) is the same as 


where the first matrix on the right is a row vector, namely, 
the i tn row of B , let us say b[ i9 and the second matrix is a 
column vector, the j th column of A , let us say a Ki , Thus 

0 

Examples 


(These will be not so much exorcises as interspersed 
illustrations. For reference they will be numbered con- 
secutively within each section, and will be distinguished 
by smaller print, without further title.) 

1. The i th row of BA is the vector-matrix product b'^A. 

2. The j th column of BA is Ba u . 


We now come upon an important distinction between 
matrix algebra and ordinary scalar algebra ; namely, 
matrix multiplication is non-commutative . Consider for 
example 

AB *=* p 11 ai2 l p 13t ^ 2 1 L r a n^n”h«jL2^2i a n^32*i“«n^22l /m 

VHl tt 22J L^21 &22JV L a 2X^n “1*^22^21 

J5A == P 11 ^ 12 1 P U ai2 1 -a p > ll a n'P^I2 a 2X ^ll a 12d ,g* 

L^21 ^22-1 1^21 a 22J '^***81 ^21^12 T 22 J 


Scrutiny discloses, in this as in the general case, that every 
element in BA differs in form from the corresponding 
element in AB, Thus in general BA ^ AB. Indeed if A 
and B are not square but rectangular, then the only ease 
in which BA and AB can coexist is when A is of order 
mxn and B is of order nxm; and in such a ease AB and 
BA are necessarily different, since the former is of order 
mxm while the latter is of order nxn. 
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II 


We must therefore always distinguish between ^pre- 
multiplication and postmultiplication of matrices ; in BA 
for example A is premultiplied by B, but in AB it is post- 
multiplied by B. 

3. Since multiplication is non -commutative, the ex- 
panded form of defined as (A-\-B){A +23), is 

A 2j r AB-\-BA J r B 2 , where A 2 is defined as A{A ). 

4. Expand {A+B)(A— B). Note that the expansion is of 
four, not of two terms. 


7. Product of Three or More Matrices 

Though not commutative, multiplication, when ex- 
tended step by step to three or more factor matrices, is 
associative. For let 0 , B, A be matrices of order mxn , 
nxp, pxq respectively, and let us form C(BA), where 
the brackets indicate that BA is first formed and is then 
premultiplied by 0 . We shall show that G(BA) is identical 
with (CB)A. For since the (h> j) ih element in BA is 

v 

% bhk a M> • * • * ( 1 ) 

1 

the (£, j) in element in G(BA) is, by the law of multiplication, 

n v 

Sc ih Bb hk a kj . . * ( 2 ) 

A » 1 k=l 

But the double summation indicated here can be equally 
well carried out in the order indicated by 

n ii 

ECijfihk ^ a 7cj> • * ( 3 ) 

h - 1 Jc = 1 

which yields, though by a different order of operations, 
exactly the same final aggregate of np terms. Indeed it 
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is customary to denote both of these equivalent double 
summations indiscriminately by 

n 

2 2c ih b hk a kj . . . . ( 4 ) 

h~l 

The matrices C(BA) and (CB)A are thus element for 
element identical, and may therefore each bo denoted 
without ambiguity by CBA . The reasoning may be 
extended step by step to the case of the product of any 
number of matrices. We conclude that matrix multi- 
plication is associative. 

1. Deduce from 6, Ex. 1, that the element of U A X 

is U(iAz ( j f where u u is the i th row of U y and x u is the j th 
column of x . Deduce also that the i th row of VAX is u h AX 
and that the j th column is UAx u . 

2. In virtue of the associative nature of matrix multiplica- 
tion the powers A 2 , A 3 , A*, . . . of a matrix A (necessarily 
square) are defined without ambiguity, and identities such as 
A 2 (A) = A (A 2 ) = A z and the like are valid. 

3. Write down a few matrices of orders 2x2 and 3x3 
with numerical elements. Find first their squares, then their 
cubes, checking the latter by using tlie two processes A (A*) 
and A 2 {A). 


The Unit Matrix. The transformation illustrated 
here by the case n = 3, 

Vi = 

Vi — *2, ■ • * • (“>) 

Vz = z 3 > 


leaves the variables unchanged except in name. It is 
called the identical transformation, and the matrix corre- 
sponding to it, 


1 



( 0 ) 


is called the unit matrix. (Here and elsewhere we shall 
use full stops to denote zero elements.) In the general 
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case the unit matrix is a square matrix, of order nxn, 
with unit elements in the principal diagonal (that is, the 
diagonal running from top left corner to the bottom right 
corner) and zero elements everywhere else ; or, more 
briefly, such that a i{ = 1, a tj == 0, i =^j. It is denoted 
by I . 

The reader is asked to verify for himself that I 2 = I, 
whence P = I, and so on ; also that I A = AI = A for 
any matrix A , though it calls for remark that if A be 
rectangular, of order mxn , then the premultiplying I in 
the above relation is of order mxm, while the postmultiply- 
ing I is of order nxn. With this proviso we can introduce 
or suppress an I at pleasure anywhere among the factors 
of a product matrix, as convenience may dictate. 

Scalar Matrix. A matrix of the form A I, as for 
example in the case n = 3 



is called a scalar matrix. Scalar multiplication, as defined 
in 5, is equivalent to matrix multiplication (pre- or post-) 
by a scalar matrix. In symbols, 

XA = XI A = A\I = AX. . . (8) 

Diagonal or Quasi-Scalar Matrix. A square matrix 
with its non-diagonal elements zero is called a diagonal, 
by some writers a quasi-scalar, matrix. It is thus defined 
bv a** = 0. i =£ j 


— 

— * 


— 


wmm 

Al • 

• 


a il 

• 

• 

A 2 

^3 

9 


a 22 

a 33 

a 4A 


are diagonal matrices. 

5. The unit matrix I and the zero matrix 0 of order 
nxn are diagonal matrices. 
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6. If A is a diagonal mat rix of order m X m and B is any 
matrix of order m x n, prove that A B is obtained from B by 
multiplying the rows of B respectively by a llf a 22 , ...» 
a nn . In particular if x is a column vector, 

A.% = ^22^2 

7. If A is a diagonal matrix of order n X n, prove that BA 
is obtained from B by multiplying the columns of B respectively 
by a ll9 a n , . . ,, a nn . In particular, if u is a row vector. 


8. Hence if A is a diagonal matrix, AB is not in general 
equal to BA. In order to have AB = BA in general, B must 
be square and A must bo not merely diagonal but scalar. 

9. Diagonal matrices of the same order are commutative in 
multiplication with each other . 

10. If the diagonal elements of a diagonal matrix D are d {y 
prove that the matrix c 0 I +c 1 DAc^D 2 +->-+o k D k is diagonal, 
and that its diagonal elements are Co+Ci^-fCad* + 

8. Transposition : Interchange of Hows 
with Columns 

The code has now been described, and the formal rules 
of operation are almost complete. It has appeared that 
matrices, under suitable restrictions upon their orders, 
obey the following laws, 

A -\“B = JB ~{~A y A = {A -f-J?) 

A(B+0) = AB+AC, (B+C)A == BA +CA, { } 

which are the ordinary laws of elementary algebra, except 
for the omission of the commutative law of multiplica- 
tion. Hence, with due vigilance regarding the order of 
factors in product terms, wo may manipulate matrices in 
addition, subtraction and multiplication by the familiar 
rules. 

1. Expand in full (it+-jE?) 3 and {A— B) 8 . Note that the 
expansions are of eight terms, which must be left separate. 

2. Expand in full (A -~B)*(A+B), {A+B){A~B)\ 
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3. Writ© down a number of simple products like the above, 
expand them in full and contrast the results with those of 
ordinary algebra, where the commutative law of multiplication 
is valid. 

Transposition. There remains, however, an opera- 
tion which has no analogue in ordinary algebra. From a 
matrix A we may construct a new matrix the rows of 
which are the columns of A, and consequently the columns 
of which are the rows of A. This operation 'is called 
transposition , and the resulting matrix is called the 
transpose of A and is denoted by A'. (In the less recent 
literature the word conjugate is used, but this word is 
already heavily overtaxed in other domains of mathe- 
matics.) In symbols, A' [<%]' = [a H ], 


4. A matrix of order lxl, that is, a scalar number, is 
unaltered by transposition. 

5. The transpose of the column vector 

X = K X 2--- X r} 

is the row vector 

=» [®l »*..*■]• 


6 . 


If A = 


ci x b x 
% ^2 C 2 


Ug 

then A' = b x b 2 
- C 1 C 2 _ 


7. Form the products A A' and A'A f whore A is the matrix 
of Ex. 6, and remark on the nature of their diagonal elements. 
Consider the case of a general matrix A . 

8. If G = A A', observe that c i5 = c ji9 or C = (7. 

9. If x is a column vector, form x'x and observe that it is 
an ordinary number, a matrix of order lxl, and that it is 
the sum of squares of the elements of x. 

10. Examine in the same way xx\ and show that it is a 
square matrix possessing the property O' = C. 

11. Two operations of transposition restore the original 
matrix ; in symbols ( G ')' = C. The operation is thus 
reflexive. 

Symmetric and Skew Symmetric Matrices. This 
new operation enables us to give simple and compact 
definitions of important special types of matrices. For 
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example, if a matrix A is unaltered by transposition, it 
must be square and symmetrical about its principal diagonal, 
so that a H = a i5 . We therefore characterize a symmetric 
(or azisymmeiric) matrix by A' ~ A. 

If A f = —A, so that a H = — <%, the matrix A is called 
skew symmetric or antisymmetric . A skew symmetric matrix 
must necessarily have zero elements in tho principal 
diagonal, since a u = —a i£ . 

" a h g~ 

12. A = h b / is symmetric. 

Jff f c . 

r 0 a If 

13. A = —a 0 c is skew symmetric. 

“ — c 0 

14. A symmetric matrix of order nXn has in general 
n(n~bl)/2 distinct elements. 

15. A skew symmetric matrix of order nXn has in general 
n(n — 1)/2 distinct elements. 

9. Tlie Transpose of a Product : Reversal Rule 

Matrix multiplication is row-into-column ; in BA the 
rows of B are multiplied, element for element, into the 
columns of A. This being so, the transpose of BA cannot 
be B'A\ for that would imply columns of B multiplied 
into rows of A. To correct this we must proceed by not 
only transposing the factors, but also reversing their order, 
thus obtaining A'B'. This reversal ensures that the rows 
of B are duly multiplied into the columns of A, while 
transposition has also been effected. Thus (BA)' = A'B'. 

The reader, before going further, is asked to satisfy 
himself concerning this by constructing and inspecting 
actual examples, for this reversal rule in the transposition 
of products is of the greatest importance. We deduce from 
it step by step the general reversal rule : 

(CBAY = (BA)' O' = A'B'C\ 

(DOB A)' = (CBA)'D' = A'B'C'D and so on. 


(I) 
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Tla© Complex Conjugate of a Matrix. If the 
elements of A are complex numbers we denote their 
complex conjugates by d ti . The matrix [<%] is then 
called the complex conjugate of A, and is denoted by A. 

1. (X) = A. Also (Ay = (A'), denoted by A'. 

2. If a, b, e, d, e 9 f , g, h are all real numbers, and i — —1, 

and if 


VaA-ib 

I , then A' = 

~a—ib e—if ~ 

1 eA-if 

aA-ih T 

c—id g — ih~ 


Hermitian and Skew Hermitian Matrices. If 
A 9 — A the matrix A is square and is unchanged by the 
operations of transposition and taking complex con- 
jugates. Such a matrix is called Hermitian. On the other 
hand, if A ' = —A the matrix A, which is necessarily 
square, is called skew Hermitian , or anti-Hermitian. 


3. 

a “ bA-ic 

eA-if 


ia 

—bA-ic 

-e+if- 


b—ic d 

hA-ik 

and 

bA-ic 

id 

— h A~ik 


e—if h—ik 

g . 


eA-if 

h — J— ik 

ig . 


are respectively Hermitian and skew Hermitian (i — V — 1)- 

4. A Hermitian matrix which is real , that is, has exclusively 
real elements , is a real symmetric matrix. A skew Hermitian 
matrix with real dements is a real skew symmetric matrix . 
Hence all theorems on Hermitian matrices include theorems 
on symmetric matrices as special cases. 

5. If A is Hermitian, then iA is skew Hermitian. If 
A is skew Hermitian, then iA is Hermitian. 

6. If A is Hermitian, it can be written as R-\~iS, where M 
is a real symmetric and S is a real skew symmetric matrix. 

7. If A is a square matrix prove, by transposing, that 
A'+A is symmetric and that A' —A is skew symmetric. 
Also that A' A- A is Hermitian, A' — A skew Hermitian. 

8. Hence any square matrix A can be expressed as the 
sum of a symmetric and a skew' symmetric matrix, since 
A = i(A -{-AOd-KA — A') ; or as the sum of a Hermitian and 
a skew Hermitian matrix, since A = \(A A-A')A-$(A —A'). 

B . 
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9. If A is any matrix , not necessarily square , then A' A 
and A A' are both symmetric ; for by the reversal rule 
(A' A)' = A'(A'Y = A'A t and similarly for A A 

10. Prove in similar fashion that if A is any matrix with 
complex elements then A' A and A A' arc both Hermit Ian, 


11. Provo that matrices of tho form 


a 61 [ c dl 

~~h ay L ~~ d cj 


are commutative, and that thoir laws of addition and multi- 


plication arc congruent with those of ordinary complex 
numbers a+ib, c+id , whore i = V — 1 . Hero wo have a 


special matrix algebra in perfect correspondence, or iso- 
morphism, with a familiar algebra. 


10. Algebraic Expressions and Relations in 
Matrix Notation 

The rules of matrix algebra being now established, we 
are able to write many commonly occurring expressions and 
relations with extreme conciseness. The following examples 
may serve as an indication. Those which introduce the 
matrix notation for bilinear , quadratic and Hermitian forms 
deserve close attention. 

1. The sum of products, ajb x +a 2 6 2 + ... +a n 6 n , which m 
the typical element of a product matrix and which occurs 
also in questions of perpendicularity in coordinate geometry, in 
the differential of a function of several variables, and in a great 
variety of other situations in mathematics, may be written 
as a matrix of order lxl, namely a% a product of a row 
vector a r and a column vector 6, or equally well b'a. (The 
name inner product of vectors a and b is often used for tho 
above sura, but the notation a'b is precise, and calls for no 
special adjective.) 

2. The sum of squares xf+z * +...+#£ is therefore x'x. 
For example the square of tho distance, in Euclidean space, 
from the origin to a point having rectangular Cartesian co- 
ordinates {x x x 2 . * * x n} is The positive value of the square 
root of x'x is sometimes called tho norm of the vector x. 

3. The reader will verify for himself that an expression 
such as a ± i%i + a 22 x\ + * * > + &nn x n is the same as x'Ax, where 
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A is the diagonal matrix having elements a u . Also that 

is x'Ay, or y'Ax . See also 

Ex. 5 below. 

4. A set of equations of linear transformation may be written 
as the equality of two column vectors , thus : 


a ll x l + a 12 Vl * * * X 2 + * 

• J T a ln x n 


Vi 

a 21 X l + <*22^2 + * 

■ +<*2 n x n 



2/2 

a ml x l + a w2 £37 2 + • ' 



y m 


"Now the column vector on the rfght is simply y, and the 
column vector on the left is Ax, where A is of order mxn 
and x is of order n X 1. Hence the set of equations is Ax = y . 
In the same way a set of simultaneous equations can be 
written Ax = h, where h is the column vector of constants on 
the right of the equations. From now on we adopt such 
abbreviated notations. 

5. An expression such as 


ESatjX&j, 
i * 


» 8=8 1, 2, m, 

j = 1 , 2 , n , 


of the first degree in each of two sets of variables and y j9 
is called a bilinear form in those variables. We may write 
it in the shape 

Vl +<*122/2 + ••• +«1 nVn) +^2(%l2/l +%2?/ 2 + * * • + %n2/n) 

+ ••• + a? *n( a ?nl2/l +^w 22 / 2 +"* +<*mn2/»)* • • (2) 

Eut this is just x'Ay written in extenso , since the bracketed 

expressions are the respective elements of the column vector 
Ay. This, then, is the matrix notation for a bilinear form ; 
and A is said to be the matrix of the form. Further, since a 
matrix of order 1 x 1 is unaltered by transposition, we may 
transpose the product x'Ay and write the bilinear form 
alternatively as y'A'x. If A = I the bilinear form is 
x'ly = x'y, or y'x. 

6. If the above bilinear form be denoted by <j>, the partial 
derivatives d<f>/dx ly d<f>/dx 2 , d<j>/dx m are the respective 
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bracketed expressions in (2), and these are the elements of 
the column vector Ay. Thus wo may write 

d(x'Ay)/dx' — Ay. (3) 

Here d/dx' denotes an operation which may suitably be 
called vector differentiation ; its formal resemblance to ordinary 
scalar differentiation is evident. In the same way we may 
write 

d(x'Ay)/8y = x'A, . . . . (4) 

the vector of partial derivatives being now a row vector. 

7. In the bilinear form x' Ay, let x — y and lot A be 
symmetric. A' — A. The bilinear form then becomes 

a n xl + 2 a^XjXi + 2a t3 x 1 x 3 + ... +2a ln x 1 x n 
+ o 22 x\ 4-2a 23 K 2 :c a + .-- +2a 2 „r 2 a:„ 

+ • ( 5 ) 

+«»»*& 

or T j, where ci,j 

i * 

Such a form is called a quadratic form , of matrix A. Thus 
the matrix notation for a quadratic form is x'Ax, with the 
understanding that A' = A. If A = /, the quadratic form 
is x'lx = x'x. 

8. The equation of a central quadric with respect to the 
centre as origin, usually written 

ax 2 +by*+cz 2 +Zfyz+%9xz-ir2hxy = I, . . (6) 

will appear in matrix notation as x'Ax — I. Written in full 
this would be 

[x y z] Xa h g~\ r x‘ 

\ h b f y ~ 1, . . (7) 

1 a f c z 

which reproduces almost exactly a descriptive notation used 
by certain writers of the last century. 

9. The tangent plane to the quadric x'Ax = 1 at a point 
y (that is, column vector y of coordinates) upon it, or the 
polar plane with respect to a point y in the 3 -dimensional 

^Euclidean space, will be given by x'Ay = 1 or equally well 
y' Ax = 1. The notation is the same for the analogue of 
this in any number of dimensions. 
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10. The reader will verify for himself that the row vector 
d(x'Ax)/dx of partial derivatives of a quadratic form is 2x'A, 
and that the column vector d(x'Ax) jdx' is 2 Ax. Compare 
Ex. 6. 

11. Let a set of variables or vector x be given linearly in 
terms of other variables u by the linear equations x = Hu, 
and in the same way let y — Kv. Then substituting in the 
bilinear form x' Ay we obtain u'H'AKv , a new bilinear form 
of matrix H'AK. In the same way the transformation 
x = Hu causes the quadratic form x'Ax to become u'H'AHu, 
a quadratic form with matrix H'AH . The reader will easily 
show, by transposition, that H'AH is symmetric. 

12. A form x'Ax, where A is Hermitian, so that A' — A, 
is called a Hermitian form. Clearly if x = Hu then x'Ax 
becomes u'H'AHu, and by transposing and taking the 
complex conjugate of H'AH we observe that H'AH is 
Hermitian. Three types of transformation of A, namely, 
H'AK, H'AH and H'AH , have thus been induced by the 
linear transformation of the variables occurring in bilinear, 
quadratic and Hermitian forms. These are called the 
equivalent, the congruent and the conjunctive transformations 
of A. 

13 .A Hermitian form is real. For by transposing and 
taking the complex conjugate we have 

(x'Ax)' — x'A'(x'Y = x'Ax. 


Thus the form, a matrix of order 1x1 and therefore a scalar 
number, is equal to its complex conjugate ; hence it must 
be real. 


11 ; Partitioned Matrices : Partitioned 
Multiplication 

Submatrices. The array of elements belonging to 
(not necessarily consecutive) rows i v i 2 , ..., i r and columns 
j v j 2 , j 3 constitutes a submatrix of A of order rxs. 

In particular the elements of A are submatrices of order 
1x1, the rows of A are submatrices of order Ixn, and 
the columns of A arc submatrices of order mx 1. 
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It is a simple observation from the product rule 

$ij = 2h ik a kh . . ( 1 ) 

z« 


where O = BA , that c tj derives exclusively from the i ih 
row of B and the j th column of A. It follows at once that 
the submatrix consisting of rows i v i 2 , i r and columns 
jx> 3 & •**> 3& °f & will be the product of the submatrix 
consisting of these particular rows of B and the submatrix 
consisting of these particular columns of A . Thus 
submatrix in a product matrix is itself the product of two 
submatrices , one from each of the factor matrices, in 
ptoper order. 

If therefore we partition B by groups of rows into 
submatrices B lf B 2 , ..., ... in vertical array, and A by 

groups of columns into submatrices A v A & , ..., A $ 9 ... in 
horizontal array, then the ( i , j) th submatrix in BA , when 
J5J. is partitioned in row-groups exactly as B and in 
column-groups exactly as A , will be The following 

is a simple illustration, the partitioning being indicated by 
dotted lines : 



b%i a ii~^b 22 a 21 

^21^12 d“ ^22^22 t J n a i3 d bniftta 

' J S2" 2 i 

6 31 a n 4-6 32 a 2 i 

^31^12 d" ^32^22 ^3 1^13 'I 23 



We shall use a notation in which this result would 
appear as 



[ Ax A 2 ^ 3 ] — * 


m B x A x B X A % BjAi 
B,A X B, Z A a 2**4 J’ 


(3) 


submatrices being disposed like elements, the or 
letters indicating that they arcs submafriees. The left 
side of (3) may also be written as {B x B 2 } [ri x A 2 A z the 
intention of the curled brackets being the same m in the 
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notation of 4 (1) for column vectors. The analogy of (3) 
with the column-into-row product 


{^i ^ 2 } \. a i a 2 %] — 


^*1^1 ^1^2 ^1^3 
P2&1 ^ 2^2 ^ 2 ^ 3 . 


• ( 4 ) 


and the general result of like kind is evident. 

A still more general law of multiplication of partitioned 
matrices is possible. In the result (1) we may break up 
the range of summation of k into sub-ranges, writing for 
example 

n »j n a n 

£>b ik a kj = 2b iJ flM+ 2b ik a k j-\~ ... + ^b ih a ki , . ( 5 ) 

Tc = X &=1 ^ = + l 


or even using partial ranges which do not invo] re 
consecutive values of k ; for example if n = 6 using 1, 3, 
then 2, 5, then 4, 6 for three partial ranges. Now 1c is 
both a column suffix of B and a row suffix of A, and so 
this division into partial ranges corresponds to partitioning 
the columns of B into groups in a certain way, and the 
rows of A into groups in exactly the same way. This will 
be called conformable 'partitioning of B and A. We may 
interpret each partial sum of terms on the right of (5) 
as the typical element of a product of two submatrices, 
the first .being constituted by a certain group of columns 
of B, the second by the corresponding rows of A. Now 
combining this with the result typified by (3) we have such 
a result as 


B =* 


Bx 


$ 11 $12 $13 



$ 21 $22 $23 


■— [A 1 A 2 A 3 ] 


A U 

+ 13 

+ 13 

A 21 

+ 22 

+ 23 

- + 31 

+ 32 

+ 33. 


BA 



$11 An +-^ 12 + 21 +^ 13 + 
$21 A 11 A $22 A 21 + $23 A 


31 

31 


■®ll+12 + *^12+22 + -®13+82 /£ 

$ 2 lA 12 A $ 2 iA 22 + ^ 23+32 

■#11+ 13 + -®X2+23 + ^13+331 
#21 + 13 +^22+23 + *® 23*^33] 


where B and A have been conformably partitioned, and 
where the partitioning of the product BA agrees with the 
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row-partitioning of B and the column-partitioning of A 
The general result of this kind is proved in the same way' 
and its analogy with the ordinary rule of multiplication of 
matrices according to elements cannot fail to be observed. 
Formally we may enunciate it thus : 

Theorem of Multiplication of Partitioned Matrices. 
Let B and A be two conformably partitioned matrices. 
Further, let the rows of B be partitioned arbitrarily into 
groups, and the columns of A also arbitrarily. Denote the 
submatrices of B and A so arising by B {j , A iS , where i 
denotes the order of occurrence in row groups, j in column 
groups. Finally let the product BA, or C, be partitioned 
according to the row-partitioning of B and the column- 
partitioning of A. Then the (£, j) th submatrix of C is 

~ £B {k A kj . . . • ( 7 ) 

In the case where all the submatrices are elements of 
B and A this result becomes the ordinary multiplication 
theorem for matrices. 

The reader is again desired not to proceed until he has 
grasped, by examples both literal and numerical, and by 
examining the steps of a general proof along the lines we 
have indicated, the full implication of the multiplication 
theorem of partitioned matrices. 

1. Consider the partitioned product 

BA = [Uj B Z ...B„] {-dj A Z ...A„} 

= B k A x + B Z A 2 + ... -f- B n A 

where the submatricos of B aro its n columns, and of A its n 
rows. Examine the products B k A k . Observe that t hey are 
all matrices of order tnxp, and that by superimposing 'them 
in matrix addition wo have BA in its usual form. 

2. In Ex. 1 wo have, as it wore, regarded B as a row 
vector having column vectors for its elements, and A as H 
column vector having row vectors for its elements. Hut we 
may equally well partition B according to its rows, regarding 
it, as it were, as a column vector having row vectors" for its 
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elements, while partitioning A by its columns. We then 
have the product 

ByAy *•* DyA^ 

B%Ay BgAt), ... B^A^ 

BA = {B 1 B z ...BJ[A l A t ...AA = 


I B m Ay B m A 2 .-B m A 

where for example 

B { A 3 - = [6 tl b i2 ...b in ] {a lt a 2j ...a nj } 

= b il a lj +b i2 a 2J +... +b in a nj . 



This way of partitioning draws attention once again to the 
fact that the elements of BA are row -into -column vector 
products. Exs. 1 and 2 of 6 now appear as simple examples 
of partitioning ; also Ex. 1 of 7. 


The algebra of matrices still lacks one important 
operation. Division, or the operation that plays the part 
of division, has yet to be defined. We should like to 
discover matrices reciprocal to A , that is, matrices B 
and S such that BA = I, AS = I. If these can be found 
it will be possible to solve a set of equations Ax = h by 
simply premultiplying by B, for this would give RAx = Rh 
or x = Rh, since BA = I. The key to the definition of a 
reciprocal matrix is therefore to be sought in the solution 
of simultaneous equations, and is found in that form of 
the solution which makes use of determinants. This will 
be the subject of the next chapter. 

3. Let J bo defined as a matrix of order n xn in which the 
non -zero elements are units in the diagonal at right angles to 
the principal diagonal, that is, the secondary diagonal. Thus 
a if n -i+! = 1, a i5 = 0, i~\~j # w + 1- For example, if n = 3 



Write out in full some products JA , where A is a rectangular 
matrix, and describe in words the effect on A. Do the same 
with products of the type AJ. Note that J ' = J, J 2 = I. 
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4. Examine and describe in words th© effect on a 
rectangular matrix A of the operation J X AJ % , where J x and 
J 2 are of the type J. 

5. Lot V bo defined as a matrix of order n x n in which all 

elements are zero except those in the diagonal of n — 1 places 
immediately above the principal diagonal (that is, the super - 
diagonal ), these being units. Thus = 1, a is = 0, 

j—i 1. For example, if n = 3 


U » 



Examine some products of type UA, U 2 A, U Z A, ... ; 
AU, AU\ ... ; U'A, ( U')*A , ... ; AU\ A(U')\ ... ; and so 
on. Note the effect on A of these various operations. 

6. Do the same with UAU, U'A U, UA (J\ U'A U\ 
where A is square ; and with the corresponding cases where 
A is rectangular and pro- and postmulti plying matrices are 
of different orders. 

7. Examine the powers of U, proving that U n = 0. Also 
evaluate UU\ U'U. 

8. Prove that polynomials in A , such as 

a 0 I + a x A + ... 

are commutative in multiplication. 

9. The sum of all dements in a general matrix A is equal 
to uAx, where all the elements of u and x are units. 

10. Prove that if A and B are such that A B and BA 
coexist, then AB and BA have the same sum of diagonal 
elements. Establish a similar property for ABO, BOA, 
CAB , and conjecture and prove a general theorem. 

11. Let H be a matrix derived by permuting in any way 
the rows of tho unit matrix JT. Then H ' will be derived from 
I by the same permutation upon columns. Form H'H and 
HH\ Inspect examples and establish a general theorem. 
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12. The Solution of Simultaneous Equations 

Let us consider the elementary solution of simultaneous 
equations in 1, 2 and 3 unknowns. 

(i) The solution of a n a? x = 7i x . „ . (1) 

is x 1 = A 1 /a 11) provided that a 13L ^ 0. . (2) 

(ii) The solution of (ixv c x J r a i 2 x % = hi> /o\ 

« 21 ^ 1 +« 22^2 1=5 ^ 2 ’ 

found by elimination first of x 2 , then of x v is 

X 1 ^ (^1 a 22 ^2 a i2)/( a U a 22 a l2 a 2l)> 

X 2 = (^2 a ll~~~^l a 2l)/( a li a 22~ a 12 a 2l)f * 


provided that the denominator a 11 a 2 2 "~^i 2 a 2 i> which is the 
same for both x x and x 2) does not vanish. 

The expressions in the numerator and denominator of 
both solutions arise from the familiar procedure of cross- 
multiplication to bring about elimination, for example 


a n V ✓ a i2 

Y yielding a n a 2 . 2 ~a^a 21 . . (5) 

a 21 


The general determinant, we shall find, is simply the 
extension of such a cross-product as the above from a 
2x2 matrix to an wxw matrix. 


27 



28 


PROPERTIES OF DETERMINANT 


(iii) Solving in the same way the equations 

°n x i +% 2 :r 2 + a 13 a; 3 = K 

a 2i x i+a 22 x 2 +a 22 x 3 = A 2 , (6) 

by elimination of x 2 and & 3 , we obtain x x in the form of 
a quotient the denominator of which is the six-termed 
expression 

a — «a 13 fif 22 <2 31 , (7) 

the numerator being another six-termed expression obtained 
from the above by substituting h v h 2i h z respectively for 
a u> a 2 i> a 3 i* Again x 2 and x z are quotients of similar 
nature with the same denominator as x v and this 
denominator must not vanish. 

The solution of four simultaneous equations may be 
carried out in the same way by the various necessary 
eliminations ; the denominator of the expressions for 
x Xi x 2 , x 3 , x l is then found to be the same in each ease, 
namely an expression of 24 terms, 12 prefixed by positive 
and 12 by negative sign, each term being a product of 
four elements a ij7 no two of which in any term have the 
same first suffix or the same second suffix. The numerators 
are also found to be expressions of 24 terms. 

It is natural to pause here, and instead of proceeding 
to the further heavy eliminations necessary for solving 
simultaneous equations in 5 or more unknowns, to study 
the properties of these numerators and denominators and 
thus by induction to define them generally, m the hope 
of solving the set of n equations in n unknowns and 
incidentally of finding the reciprocal matrix of a given 
matrix. The expressions constructed by this synthetic 
method are in fact determinants . The old name for them, 
eliminants , faithfully reflects their historical origin. 

13. Salient Properties of Eliminants 

The properties likely to be useful for purposes of 
definition emerge already in the examples we have given. 



EVEN AND ODDVBERMUtAXiONS 

The respective denominator! a xl) a lx a % ^a^ 21 ^md 
the longer expression (12, (7)% <ffl Ikterms, presumably oj 
nl terms in general. For n > nk s^^rently half of tiie 
terms are positive and half are 

consists of a product of n factors a ij} an<fifirhhes€ pruuucts 
all the first suffixes, 1, 2, ..., n and all the second suffixes 
1, 2, ..., n appear once and once only. Hence if we order 
the factors in a teffri so that the first suffixes, which are 
really row suffixes of elements in the related square 
matrix A, are in natural order 123... n, then the second 
suffixes will be some permutation a/3 y...v of that natural 
order. 

We are therefore led to define a determinant of order n 
as the following function of the elements a i5 of a square 
matrix A: „ . 

^rn a ia&2 ft ••• a nv i • • * (1) 

the summation being extended over all the possible nl 
permutations of natural order of second suffixes a, /3, v. 

It remains, however, to discover the rule by which the 
sign 4- or — is prefixed to any given term. This rule 
depends on the nature or class of the permutation (aj3...v) 
of (12... n) ; and this requires a study of the classification 
of permutations into even and odd classes. 

14. Inversions, Transpositions, Even and 
Odd Permutations 

When two indices in a permutation are out of natural 
order, the greater index preceding the lesser, like the 3 
and the 2 in (1324), or the 5 and the 1 in (52341), such 
a derangement is called an inversion . 

The interchange of two indices in a permutation without 
alteration of the rest, as when (13425) becomes (13524) 
by interchange of 4 with 5, is called a transposition. 

The number of inversions of natural order in any 
permutation (a/3...v) can be counted systematically and is 
unique. A permutation is said to be even . or of men class , 
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when the number of inversions in it is even, and odd, 
or of odd class , when the number of inversions is odd. 


15. Definition of Determinant 

We are now able to define the determinant \A\ or \a ti \ 
of a square matrix A of order nxn thus : 

\A\==\a ii \?=2±a la a i p ... a nv , . . (1) 

the summation, of nl terms, being extended over all 
permutations (a/?...v) of second or column suffixes of the 
elements a {j , and the sign -f or — being prefixed to any 
term according as the permutation is even or odd. 

Permanent. The corresponding sum with terms all 
prefixed by the positive sign is called the permanent of A ; 
its properties are neither so simple nor so rich in application 
as those of determinants, but it has an importance in the 
theory of symmetric functions and in abstract algebra. 

-r -*• 

We shall denote it by \A\, 

Notation of Determinants. The early writers, 
Cauchy, Jacobi and others, denoted \A \ as we have done 
above, by 

2 — a nv 


or some similar notation. Cayley in 1841 introduced what 
has remained, for nearly a century, the standard notation 


^11 *** 
a %\ a 22 * • * C hn 


( 2 ) 


a nl a n2 *** a nn 


Other notations are ]a 11 a 22 ...a n »|, the determinant being 
indicated by its diagonal elements ; or the single suffix 
notation | a 1 b 2 ...k n \, distinct letters being used to denote 
column order. Vandermonde, Sylvester and other writers 
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used a so-called umbral notation, in which the letter a is 
discarded and the suffixes themselves are made pre- 
dominant. In this notation 


/1234N /2134\ /2134A 
Vl234y, \1234/ , \4231 / 


(3) 


would represent respectively the general determinant of 
the 4th order, the same with first and second rows inter- 
changed, and the same with first and second rows and 
first and fourth columns interchanged. 

The practice in the present book will be for the most 
part to use | A \ or \a if \ 9 leaving the order to be understood 
from the context ; and we shall also indicate the 
determinant of a partitioned matrix by using the typical 
vertical bars of Cayley's notation ; thus, 


Ai 

Bx 


or 


^■1 ^2 

B 1 B 2 


• (4) 


Other notations, in particular the one by diagonal 
elements (which however fails to distinguish between the 
determinant of A and that of A') will be used as occasion 
may require. 

A few easy examples will serve to illustrate the classing 
of permutations by counting inversions, and the formation 
of determinants according to the definition. 

1. To count the number of inversions in a permutation. 
Take for example (43521). The inversions are 43, 42, 41 ; 
32, 31 ; 52, 51 ; and 21. In all, 8 inversions. The class of 
(43521) is therefore even, and the term (iu a zz a zz a M a $i in a 
determinant |/I | = | a n a n a zz a M a 5S ) of the 5th order would 
receive the positive sign. 

2. The mntibor of inversions may be counted by the aid 
of a diagram thus. Let the indices be written in an upper 
line, spaced out and in natural order, for example 1 2 3 4 5. 
Directly below them write in the same way the permutation 
in question, and then join each index in the upper line to the 
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same index in the lower line, curving the joins if necessary so 
that all intersections are of two joining lines and not more. 
For example : 

1 2 3 4 5 



4 3 5 2 ] 


Here, since 4 and 3 are out of natural order in the lower 
line, the joins of 4 with 4 and 3 with 3 must intersect. The 
same is true for each inversion that is present in the lower 
line ; hence the number, as here eight, of intersections of 
the joining lines is equal to the number of inversions. 

The reader is invited to try further examples. The 
diagram is merely a convenient device for giving visual form 
to the other method of counting inversions. 

3. In the ways just indicated we may class the orders 
(123), (132), (231), (213), (312) and (321) as even and odd 
alternately. Hence the determinant \A\ of the 3rd order is 

^11^22^ 33 — ^11^2 3^ 32 "h ^12® 2 3^ 31 2 tt 21 a 3 3 “h 3^21^ #2 — a l Z a tZ a Zl i * ( 

as has appeared already (12, (7)) in the denominator of the 
solution of three simultaneous equations. 

Effect of Transposition. A single transposition of 
order changes the class of a permutation . 

Consider (aja^a^a^aQ). Suppose a 2 and a 5 are inter- 
changed. This may be done by passing a 2 over each of 
a 35 a a 6 in turn, let us say in general over m indices, and 
then passing a 5 back over a 4 , a 3 , in general over m—1 
indices. Now each passing over a single index either 
introduces an inversion or removes one, according as the 
two indices concerned were or were not previously in 
natural order ; and in either case the class must be changed. 
Hence we have in all an odd number, 2m— 1, of changes of 
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class ; so that a transposition changes the class of a 
permutation. 

It follows that the difference between the number 
of inversions introduced and the number removed by a 
transposition must be an odd number. 

Conjugate Permutations. Let us consider (3241) and 
(4213), and let us distinguish index, and place occupied , 
as follows. In (3241) the index 3 occupies the first place, 
place 1 ; in (4213) the index 1 occupies the third place, 
place 3. If the reader will examine the other indices in 
these two permutations, and the places they occupy in 
each, he will note that the numbers expressing index, and 
place occupied, in (3241) are exactly the numbers expressing 
place occupied, and index, in (4213). Two permutations 
in which the rdles played by index, and place occupied, are 
interchanged in this way are called conjugate permutations. 

It is clear that a second interchange restores the original 
position ; in other words the conjugate of a conjugate 
-permutation is the original permutation. It is possible, 
too, for a permutation to be its own conjugate, or self- 
conjugate ; the reader will verify, for example, that (13254) 
is self- conjugate. 

4. Consider the diagram used in classing permutations. 
The upper line of numbers in natural order is simply what wo 
have called place occupied ; the lower line is what we have 
called index. Hence if we write the natural order such as 
1 2 3 4 below the diagram of crossed lines and then complete 
the top line, the top line will give the conjugate permutation. 
1 2 3 4 4 2 1 3 



3 2 4 1 1 2 3 4 


C 
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5. It follows at one© that the diagram of a self-con jugate 
permutation must be symmetrical about its horizontal bisector. 


12 3 4 



6. If any term of a determinant such as a la a 2 fi . . a nv 
has its factors reordered so that column suffixes a/?. , . u come 
into natural order, the term then becoming a a ' x . . . a v r n 
then the resulting permutation of row suffixes (a'/T... j/) is 
the conjugate of the former permutation of column 

suffixes. 

The reason is that such a reordering is precisely the 
interchange of index with place occupied. 


16. Identity of Glass of Conjugate Permutations 

There is a simple but important theorem, namely, 
that conjugate permutations have the same number of inver- 
sions of natural order. 

Consider the diagrams used for counting inversions by 
the number of intersections, as exemplified in 15 (7). The 
diagrams are the same. Hence the number of inter- 
sections, and so of inversions, is the same ; and so conjugate 
permutations are of the same class . 

Now to rearrange factors in each term of a determinant 
so that column suffixes, not row suffixes, are in natural 
order, is the same as to define the determinant in the old 
way but with respect to a matrix obtained from A by inter- 
changing rows and columns , and this matrix is of course 
A'. Since, by 15, Ex. 6 and the theorem just proved, all 
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terms in such an interchange retain the same sign as 
before, we have the fundamental theorem : 

The determinants of A and A ' are the same. 

Example 

Write out in full the six terms of the determinant | A | 
of the 3rd order. Rearrange the factors in each term so that 
column suffixes are always in the order (123). Then put 
B = A', expand \B\ according to uhe original rule and compare 
the results. 

Relative Class of Two Permutations. To decide 
whether two permutations of (12 ...n) are of the same or 
opposite class we need not count the inversions of each 
with respect to (12...rc) ; it is sufficient to count their 
relative inversions with respect to each other.* For example 
(3142) and (4213). 

3 14 2 



Here there are five intersections, so that five transpositions 
of contiguous indices are required to remove the relative 
inversions. (The moves are 3 with I, 3 with 4, 3 with 2 ; 
1 with 4, 1 with 2 ; these make 3142 become 4213.) 
Hence the permutations are of opposite class. 

It follows that the sign of a term in the expansion of a 
determinant can be decided, not only by the number of 
inversions of column suffixes when row suffixes are in 
natural order, or by the number of inversions of row suffixes 
when column suffixes are in natural order, but also by the 
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relative inversions of row with, respect to column suffixes 
when the factors of the term are in any order. 

Our example thus shows that the term & 34 a 12 a 41 a 23 , 
as likewise the term a iZ a 21 a u a Z2 in a determinant \A ] 
of the 4th order, would receive the negative sign. 

Permutations have many other properties, but those 
we have just given are sufficient for developing various 
simple properties of determinants, which are of use in 
evaluating them by expansion in various ways. 


17. Elementary Properties ol Determinants 

1. As we have seen, an immediate consequence of the 
fact that conjugate permutations have the same class is 
that \A'\ = \A\. 

2. Let columns j and h of A be interchanged. The 
natural order of row suffixes is not altered, but in all the 
terms of the determinant of the matrix B obtained in this 
way from A the column suffixes receive one transposition, 
namely, j with Jc . Hence, as compared with the same 
terms in \A\, the terms have undergone one change of 
sign. It follows that |2?| *= — \A\. 


1 . 



—5 2 

7 3 


— 29 . 


3. Since \A'\ = \A |, it follows at once that interchange 
of two rows in \A \ produces a determinant \B\ = ~~|A|. 


2 . 



3 

2 



—29. 


4. If two rows, or two columns of A are identical, then 
\A | is unaltered by their interchange ; yet, as we have 
just seen, the new determinant is —\A\. Hence 
\A \ = —\A\, so that \A \ = 0. Thus a determinant with 
two rows or two columns identical vanishes . 

5. For n> 1, a determinant \A \ possesses nl/2 positively 
signed terms and n\ j2 negatively signed terms. (Deter- 
minantal sign due to permutation is meant, not the 



PRIMENESS OF DETERMINANT 


37 


adventitious sign that a term may have because a i5 happens, 
for instance, in one case to be 7 and in another to be —5.) 

For let us put every = 1. Since all rows, or 
columns, are now identical we have | A | = 0. Also every 
term in the expansion of \A\ must be numerically equal 
to 1. Hence half of the terms are positive, half negative. 

The following proof is also instructive. Of the n\ 
permutations, n> 1, suppose m are of even class. Sub- 
jecting these to a given transposition, such as the inter- 
change of the index in the first place with that in the 
last, we obtain m permutations all different, as is readily 
seen, and all of odd class. Hence the number of permuta- 
tions of odd class must be equal to or greater than the 
number of even class. By exactly similar reasoning the 
number of even class must be equal to or greater than the 
number of odd class. Hence even and odd permutations, 
for n>l, are equinumerous. 

6. The determinant of a diagonal matrix, since all 
non-diagonal elements are zero, reduces to a single term, 
namely, the product a n a 22 ...a nn of the diagonal elements. 

In particular the determinant of the unit matrix I is 
equal to 1, the determinant of a square zero matrix is 
equal to zero, and the determinant of XI is equal to A w . 

7. If every a i5 be changed to A<%, every term in the 
determinant | A ( is multiplied by A n . Hence \XA | = A n \A |. 
In particular |— A\ = (— ) n |A|, |— 1\ = (— l) rt . 


^ 18. Primeness of a Determinant 

A determinant, regarded as a multilinear polynomial 
in its elements (that is, linear in any one element), 
cannot be factorized into factors rational for all values 
of the In other words it is a prime polynomial in its 
elements. 

For let us assume on the contrary that it can be 
factorized into two polynomials 6 , </>. Suppose that 6 
contains a certain element a i5 ; then from the definition 
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of | A | it is clear that <f> can contain no elements with row 
suffix i or column suffix j. Suppose again that contains 
a certain element a rs ; then in the same way 0 can contain 
no element with row suffix r or column suffix s. But this 
means that the elements a is and a rj can occur in neither 
factor, which is absurd, since they certainty occur in \A\. 
Hence the assumption is false, and \A | is irreducible into 
polynomial factors. 

19. Various Modes of Expansion of a Determinant 

Expansion of | A | by Elements of a How or Column. 

What is the coefficient in ] A | of the leading element a n ? 
Evidently it consists of all the terms 

a nv > • • (f) 

and since the row and column suffixes 1, 1 in a n are in 
correct natural order they make no contribution to the 
sign of any term in (1). This sign is therefore given by the 
permutation (fty.'.v) of the natural order ( 23 ...T&). 

For example, the coefficient of a u in |<%& 2 2 a 33 1 is 

Now the aggregate of terms in (1) above, (n—1)! in 
number, constitutes by definition the determinant of order 

n — 1 , 


a 22 

a 23 ... 

a 2 n 

a Z2 

U 33 ... 


a n2 

a n3 ... 

®nn 


obtained by deleting from | A | its first row and first column. 
We shall denote this determinant by \A n \. 

What is the coefficient of % in the expansion of \A \ ? 
In | A\ let us pass the i ih row upwards over the others 
until it becomes the first row, and then let us pass the j th 
column to the left over the others until it becomes the 
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first column. The effect of this is to make a i6 become the 
leading element in a determinant derived from \A\ by 
(i— l)+(j— 1) changes of sign; and the coefficient of a u 
in the expansion of this determinant is the determinant 
of order n—l obtained by deleting the row and column 
containing a u . It follows that the coefficient or cofactor 
of a y in |i| is the determinant obtained by suppressing 
the i ih row and the j th column of \A\ and giving the 
sign (— )<+*-*, that is, (—) <+i or equally well (-—)*-* 
or (— 

We shall denote this cofactor of in \A\ by \A tj \. 
It could also be denoted by d\A\jda if . The determinant 
obtained from j A | by simply suppressing the i th row and 
j th column, without giving sign (—)*+*, is called a minor 
of \A | ; and so we shall sometimes call a cofactor a 
signed minor. The determinant obtained by suppressing 
m rows and m columns of \A\ is called a minor of 
order n—m. 

Now in the expression 


«uMul+«uMul+- +a ln \A m\ • (3) 

there are altogether {n — \)\n terms, that is, n\ terms, 
all different and all occurring with correct sign in \A |. 
But the expansion of | A | itself has only n\ terms. Hence 
the expansion (3) is a complete expansion of \A\. It will 
be called the expansion of | A | according to elements of 
the first row and their cofactors. 

In exactly the same way, since we can bring any row 
into first place with no more than a change of sign in \A |, 
we see that there is an expansion of \A\ according to 
elements of any row, namely, 

\A\ = a il \A il \+a i2 \A i z\A--->+&in\Ain\^ . (4) 

and further, since \A'\ = \A j, according to elements of any 
column and their cofactors. 
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The following examples give useful reductions and 
transformations of determinants, depending on the 
properties thus far derived. 


1. 


a i ft 

ft ^2 

^8 ft C 3 


a x b * C * -b, 02 Ca 
ft c 3 cl 3 c 8 


+<k 


a> 2 b 2 
& 3 ft 


*-<h 


ft c 2 

ft c 3 


~ct 2 


ft °i 


-j-a 8 


ft Ci 
ft c a 


and four other similar expansions. 


2 . 


1 -3 7 

-2 4 5 

3 1 -1 


= 1 


4 5 

1 -1 


-(-3) 


-2 5 
3 -1 


= —9 + 3 ( — 13) +7 ( — 14) 
«= -146. 


+ 7 



The reader should evaluate this determinant according to 
elements of rows and of columns and cofactors in all the other 
possible ways, and should invent and work through many 
other numerical examples of the same kind. 


3. 


a x . 

®2 ft ^2 ft 

#3 ft C 3 ft 
ft £4 d 4 


% 


ft ft 
ft c s ft 

64 c 4 d 4 


- (5) 


by expanding the determinant on the left in terms of its 
first row. 


1% + Aa! ft + Aft Ci + A/i 


a l ft C l 


<*1 ft n 

j a 2 ft C 2 

= 

I ^2 ft c 2 

+A j 

j ft c 2 

1 ft C 3 


! ^3 ft C3 


i # 3 ft ^8 


again by expanding in terms of the first row. 

5. In the above, put [a x ft 7 J == [ > a ft c z ] or = [a z ft c 3 ]. 
Then the second term on the right involves a determinant 
with two rows identical and so vanishes. Hence the important 
result : 


A determinant is unaltered in value when to any rota , or 
column, is added a constant multiple of any other row , 
or column. 
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6. Evaluate the determinant of Ex. 2 by using this fact. 
One way is as follows. Perform on the determinant the 
operations row 2 -f-2 row 1? and then row 3 — 3 row 19 the row 
to be modified being always written first in these operations. 
Then expand in terms of the first column. T his gives 


1 —3 7 

0 —2 19 

0 10 -22 


—2 19 

10 -22 


-146. 


7. Perform on the same determinant of Ex. 2 the operations 
col 2 -f3 col x , col 3 -7 col x , and then evaluate as before. 


A a x A b x Xc x 


Xa x izb x vc x 

; ' 1 

a x a 2 a 3 

fj.a 2 jjib 2 f±c 2 

= 

Xa 2 fzb 2 vc 2 

II 

b x b 2 b 3 

va 3 vb 3 vc 3 


Xa 3 fjth 3 vc 3 

1 ! 

Ci Co Co 


9. By means of the results (i) \XA\ = \ n \A\ and 
(ii) \A '[ = \A \ prove that the determinant of a skew symmetric 
matrix S of odd order vanishes identically, and that the 
determinant of a skew Hermitian matrix of odd order is a 
purely imaginary complex number. 

(In the first case \S\ — — \S\, in the second \S\ = — \S\.) 


10 . 


1 1 1 
a b c 
a 2 b 2 c* 


= (c— 6)(c — a)(b — a). , 


(8) 


(By expansion in terms of any column, the determinant is 
seen to be a polynomial in a, or 6, or c. Further, if b — a, 
or c = a, or b — c, the determinant has two columns equal 
and so vanishes. Hence by the Remainder Theorem it con- 
tains ( b —a), (c —a) and ( 0 — 6) as factors. But the term arising 
from the principal diagonal, the leading term, is be 2 . Hence 
the remaining factor is purely numerical and is equal to unity.) 

11. The determinant of Ex. 10 is a case of an alternant , 
so called because to interchange any two of a , 6, c is to inter- 
change two columns and therefore to alter the sign of the 
polynomial. If we write this alternant as ]a° b l c 2 |, the reader 
will prove similarly that the general alternant |a°6 1 c 2 ...& n ~ 1 | 
is equal to the continued product of all the differences that 
can be formed from the £w(n — 1) pairs of letters taken from 
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a , b 9 k, the alphabetical order being reversed in each 
pair. The polynomial so derived is called the difference- 
product of a 9 b, 1c and will be denoted by A {a, b 9 

12. | a 1 £> 2 ...& n | = ab...JcA(a 9 6, ... k) — A(0 9 a 9 6, ..., k). 

13. \a l b 2 ...k n \ is the alternant |a 1 6 2 ...& n |, with indices 
lowered into suffixes. In fact a determinant may be regarded 
as the result of a symbolic operation of difference-product 
type, in the sense that if E a is an operator defined by 
E a a { = a i+v and E h9 ..., E k are similar independent operators, 
then 

A(0, E a9 Efo ..., E k ) a 0 b 0 ...Ic 0 = 


14. If we write the general alternant as 

.0 


K°2 


.. a 


, 71-1 


we may note that the element in the i th row and j ih column is 
a*" 1 . Hence an alternant matrix is briefly characterized by 

[a j" 1 ], and its transpose by 


a ll a i2 

... a ln 

* <2 22 

... a 2n 

’ ’ . 

. . . ci nn 


a lX a 22 ‘" a nnt 


. . (9) 


where = 0, i>j. (All elements below diagonal are zero.) 
To prove this, expand in terms of the first column, and repeat 
the operation on the determinants of lower order so obtained. 
Alternatively, note that any term involving elements from 
above the diagonal must necessarily also involve elements 
from below the diagonal, and these are all zero. Thus only 
the diagonal or leading term survives. 


Length, Area, Volume and Hypervolume as 
Determinants. In the following examples we develop 
in determinantal form the length of a straight line segment, 
the area of a plane triangle, the volume of a tetrahedron 
and the hypervolumes of the successive analogues of these 
figures in space of higher dimensions. 
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16. The line segment. If A is on a straight axis OX at 
distance x x from the origin 0, and B is at distance the 
signed length BA is x 1 —x 2 , and this is equal to 


x i 

X 2 


1 x-, —a 

, or 1 
1 x z —a 


1 

1 • 


( 10 ) 


the second of these expressing the fact that the length is 
invariant under change of origin from x = 0 to x = a. If B 
is the origin, x 2 = 0 and the length is merely x x . 

17. The triangle. Let the reader draw a triangle ABC in 
the first quadrant XOY of ordinary rectangular axes, and 
perpendiculars from A = (x l9 y x ) t B = (a? 2 , y 2 ), C = (x 3f y «) to 
OX to meet OX in A\ O'. Then, with due regard for 
sign, as given by direction of base segments, the area ABC 
is seen to be the sum of the areas of trapezia on bases B'A', 
A'C', C'B', the first trapezium for example being B'A'AB. 
Since the area of a trapezium is the length of the base multi- 
plied by the mean of the extreme ordinates, the area ABC 
is by (10) 


i { (Vi+Vi) 


x 1 1 
x 2 1 


+(y% + 2 / 3 ) x 2 ! 


+{y* + 2 / 1 ) 


511 } 



*1 1 2/2 +2/3 


*1 2/1 1 

= i 

^2 1 2/3+2/! 

= i 

x t y 3 1 


*3 1 2/1 +2/2 


y 3 1 


the last determinant being obtained from the one on the left 
of it by the operations ( 2 / 1 + 2 / 2 + 2 / 3 ) c °l 2 > col 3 — col 2 , cancel 
3/i + 2 / 2 +2/s from col 2 again and rearrange columns. 

The determinant (11) is easily seen to be unaltered if each 
x t is replaced by a*— a, each y t by the invariance of area 

under change of origin being thus expressed. Note also that 
if C is the origin (x 3 , y 3 ) = (0, 0) and the last row and column 
of the determinant in (11) may then be deleted. 

18. The tetrahedron. With a similar naming of points, 
let the reader draw a tetrahedron in the first octant OXYZ, 
and perpendiculars from A, B, C, D to the plane XOY to 
meet it in A', B', C', D'. (In a first drawing he may lot A' 
fall within the triangle B'C'D'. In later drawings I 10 should 
consider varied orientations.) Then the volume of the 
tetrahedron is seen to be the sum of signed volumes of prisms 
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on triangular bases B'C'A', C'A'D', A'D'B', D'B'C'. Since 
the volume of a right prism is the base area multiplied by 
the mean of the vertical edges, the volume of ABCD is by (11) 


Xi y x 1 z 2 +z 3 +z 4 »i Vi z i 1 

i x 2 y 2 l z z +z 4 +z x x 2 y 2 z 2 l 

t x 3 y 3 1 z^+^i +^2 t x 5 y 3 z s l 9 
x 4 2/4 1 *i 4-^2 +Zs ^4 2/4 ^4 1 


(12) 


the last determinant being obtained from the one on the left 
of it by the operations (z 1 +Z 2 +z 8 +z 4 ) col 3 , C 0 I 4 —C 0 I 3 , cancel 
2 ^ 4 - 22 +Z 3 +Z 4 from col 3 and rearrange columns. 

The form of the general result is now plain. The 
determinant occurring in it is of order n - f 1 and of the type of 
(12), and the outer factor is 1 jn\. If the last corner is the 
origin, all elements of the last row except the final unit are 
zero, and the determinant reduces to one of the n th order, like 
that obtained by deleting the last row and column of (12). 
It is also clearly invariant under a change of origin from 
(0, 0, ..., 0) to (a, 6, ..., h). 

In the corresponding figures of completed type, the line 
segment, the parallelogram (sum of two equal triangles), the 
parallelehedron (sum of six equal tetrahedra) and so on, the 
length, area, volume and hypervolumes are expressed by the 
same determinant, but without the factor 1 jn\. Those 
determinantal forms are valuable in the discussion of 
differential elements of volume in space of 2, 3 or more 
dimensions. 


Elementary Operations on Determinants. The 
following examples are important as showing that 
certain commonly occurring elementary operations on 
determinants, such as interchanging or permuting their 
rows or columns, replacing a row (or column) by linear 
combinations of rows (or columns) and the like, can be 
effected by pre- or postmultiplication of the matrix A 
of the determinant by suitable matrices. 

19. Interchanging two rows. Consider 

• 1 . cq Cj cl g 

1 • ci 2 £> 2 ^2 ~ Ci . p » (1*1) 

.* • 1. b 3 Cg^ Ctg 63 CjJ 
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Premultiplication of A by a matrix obtained by interchanging 
the first two rows of I has effected the interchange of the first 
two rows of A . The reader will prove that in general any 
permutation of the rows of a matrix A can be effected by 
premultiplying A by a matrix derived from I by the same 
permutation of rows. Compare 11, Ex. 3. 

20. Any permutation of the columns of A can be effected 
by postmultiplying A by a matrix derived from I by the 
same permutation of columns. This is simply the result of 
Ex. 19 transposed. 

21. 1 A . Gq b ^ c{\ H f" A &2 “b ACg - ! 

• 1 * ^2 C 2 I ^ j ^2 I* (14) 

. • • 1. . a 3 ^3 c 3j L <*, b 3 C 3 J 

Thus operations such as row^-j-Arow* are effected by a 
premultiplication by I with a A inserted in the ( i , Jc) th position. 
The reader will at once see the suitable premultiplication for 
row^+AroWfc-bju, row fe , and so on. 

22. By transposing Ex. 21 we see the suitable postmulti- 
plications that will effect eoly+A col^+ju, col fc and the like. 

23. Observe that 



and show in general that any matrix with units in the principal 
diagonal and all elements zero on one side of the diagonal 
can be expressed as a product of matrices representing 
elementary operations on rows or columns. 

24. From expansion by a row it follows at once that if all 
elements in a row of \A\ are changed in sign the resulting 
determinant is equal to — \A\. A useful consequence is 
that if the 2nd, 4th, 6th, ...rows and the 2nd, 4th, 6th,... 
columns of \A\ are changed in sign the determinant is un- 
altered in value. In a brief notation |( — ) <+ *a w | = \a tJ \, 

20. Practical Evaluation of Determinants 
by Condensation 

The method of expanding a determinant according to 
elements of a row or column is in general of practical 
service only when the determinant is of no higher than the 
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3rd order. For higher orders the process has to be repeated, 
and the number of terms soon becomes very large. For 
this reason less cumbrous methods based on condensation , 
that is, on the systematic reduction of a determinant to one 
of lower order, are much superior in practice. Most of them 
are variants of a method ascribed to Chid (1853), but 
virtually used by Gauss more than forty years earlier in 
evaluating symmetric determinants. 

Let us consider the case of the 4th order, 

a i h c i 

^2 ^2 ^2 / 1 \ 
^3 C 3 ^3 

a x b d C d d d 

and let us suppose meanwhile that a x ^ 0. No generality 
is lost, because if a x were zero we could bring some non- 
zero element to leading position by an interchange of rows 
and of columns, altering at most the sign of \A\. Next 
let us multiply all the rows of \A | except the first by a v 
Finally, let us perform the operations 

row 2 — a 2 vow v row 3 — ayrow-^ row 4 — a 4 row x . 

The result is 

a x W c i 

a 3 \A I sss a i C 2 a i^2 

a x a z a x d 3 a x d^ 

a \ a t a x c 4 a x d d 

I ^1^2 1 |^1 C 2 [ |&ld 2 | 

= a x l^ldyj J^lCyj [a^yj . . . (2) 

|^id 4 | KcJ Ja x cZ 4 j 

A rule of condensation is thus derived, which may be 
formulated in the general case as follows : 

Let a x be called the pivotal element or pivot. Then the 
determinant \A\ of order n is equal to 1/a”” 2 multiplied 
by a determinant of order n—l, the elements of which are 
the minors of | A | of order 2 having a x as leading element. 


% h 
0 \a x b 2 \ 
0 My j 
0 \a x b x \ 


°i d x 

I ^1^2 1 I ^1^2 

\ a x c z\ \ a i^Z 

\<h c t\ \ a i c h 


\A\ — |M2 C 3^4| — 
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placed in proper priority of rows and columns. Thus the 
minor belonging to the 1st and (£+l) <A rows, 1st and 
(j~\-l) th columns of \A [ will be the (i, j) th element of the 
condensed determinant. 


1 . 


M 2 c 8 [ = ar 1 


l®AI Kc 2 || 

i«AI Kcsif * 


.(3) 


2. If A is symmetric and the pivot is the leading element 
a u , the condensed determinant is also symmetric. 


The pivot need not be the leading element, which may 
be zero. Any convenient non-zero element a i5 in \A\ 
may be taken as pivot. The condensed determinant then 
has for elements all the minors of order 2 containing a ij} 
each taken as if were its leading element and given sign 
accordingly. In other words, the diagonal of these minors 
which contains the pivot a ij} no matter how it is situated 
in | A\, is counted as the leading diagonal of the minor. 
Then \A\ is evaluated as before, except that the sign 
(___)*+* must be imposed. 


3. 



-10 11 
7 -11 



The pivot is italicized. The reader should work this 
through for himself, checking the result by choosing other 
pivots, as well as by expanding in terms of a row or column. 
He should also do 19, Ex. 2 by pivotal condensation. 

A determinant of order n being reduced by a first 
pivotal condensation to one of order n— 1, the latter in its 
turn can be reduced by a second pivotal condensation to 
one of order n— 2, and so on until finally we have a 
determinant of order 2, a simple cross-product. Naturally 
the case where the pivot is 1 or —1 will give the least 
arithmetical work. It is always possible, by a preliminary 
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division of the row or column containing any non-zero 
element, to turn that element into a unit pivot. This may 
introduce decimal fractions and approximations into the 
computation ; in practical applications, however, in which 
the elements are in any case approximate, rounded off to 
a certain number of digits, no disadvantage ensues from this. 



The pivot is again italicized. The reader should complete 
the evaluation, and should repeat the work with different 
pivots. The value of the determinant is —18. 


Leading pivots are the most advantageous. Next to 
them in convenience come diagonal pivots. 


Some Important Identities. It may be observed 
that pivotal condensation corresponds exactly to the pro- 
cess of successive elimination by which simultaneous linear 
equations are solved. Thus if we have a set of equations 
Ax = h from which we eliminate x j} using the i ih equation 
for this purpose, the matrix of the equations so derived is 
obtained by condensing A with respect to the pivot a if . 
The reader should verify this. 

The method of condensation gives rise to a hierarchy of 
useful identities. We have in the first place 


Msfol = af 1 


Mai !^i^2 ! 
Mai 1 


0 . 



Next we have 

Ma c 3^l = a T 2 


M2 1 

i^AI I ^1^3 1 I ^1^3! 

|a x 6 4 | jaA) 


a i # 0 . 


( 5 ) 


Carrying out the condensation one stage further, with 



EXTENSIONAL IDENTITIES 


49 


leading pivot (assumed non-zero) and using (4) in 

the evaluation of every minor, we derive at once 




— \ a Tp2> | 


1 

| a i&2 C 3 1 

1^1 ^2^3 1 


l a i^2 C 4l 

|^1^2^4| 


( 6 ) 


In exactly the same way, at the second stage of 
condensation of [#i& 2 c 3 ^ 4 e 5 l we arrive at 


I — J& 1&2 


a P 2 C 3 I 

1 


%^2 C 4| 

| ^1^2^ 4 1 


a l^ 2 C s| 

l&iM'sl 

| ^1^2^51 


ITow condensing once again, with leading pivot (assumed 
non-zero) j<z 1 6 2 c 3 [ > and using (6) repeatedly, we obtain 


= KVsI"” 1 


[Gq&gCgC^j |^1^2 C 3 e 4l 

I ^1^2 C 3^ 5 | | 5 1 


( 8 ) 


The derivation of further identities of the kind can be 
continued step by step. Clearing of fractions we obtain 


\a x \ |#i&2 C 3 i 
|%^2l |^1^2 C 3^4 
| tt 1^2 C 3 1 l%^2 C 3^4 e 5 


= I l^lM | a l C 3| 1 9 
= | \ap 2 c z \ ja^^J |, 

^ I \ a p2 C Z^i\ \ a Pl C 2, e f>\ |j 


( 9 ) 


and so on. Here the determinants of the 2nd order on the 
right have been indicated by their diagonal elements, which 
themselves are determinants. (Determinants having deter- 
minant elements are called compound determinants.) The 
identities are now polynomial identities, valid whether the 
determinants and minors concerned vanish or not. They 
are cases of more general identities (45) called extensional 
identities. 


5. Expansion of a skew symmetric determinant of even 
orcler yields a polynomial which is a perfect square. For the 
identity (9) applied to the determinant gives on the left the 
product of the determinant and its leading minor, also skew 
symmetric, of order 71 — 2, and on the right, as is easily seen 
(19, Ex. 9), a skew symmetric determinant of order 2. Hence, 

D 
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since a skew symmetric determinant of order 2 is by inspection 
a square, the original determinant will be a square provided 
that its leading minor of order n—2 is a square. By the same 
reasoning this minor will be a square provided that the 
leading minor of order n — 4 is a square. Proceeding in this 
way, we see that the result depends on the particular case, 
that a skew symmetric minor of order 2 is a square ; and the 
truth of this is evident. 

In this proof we have assumed that the leading minors of 
even order do not vanish. No generality is really lost by this 
assumption. 

The reader should go through the steps of the above 
proof, taking a skew symmetric determinant of the 4th 
or 6th order. 

The square root of a skew symmetric determinant of 
even order is a polynomial in the \n(n— 1) elements, called 
a Pfaffian. Pfaffians have properties in some respects 
resembling those of. determinants, but they lack a simple 
multiplication theorem, such as is derived for determinants 
in 34. 

6. The Pfaffian of order 2 is a 12 a 3 ^ — 

7. Prove that the terms in a Pfaffian are composed of 
factors a ij9 i<j, without repetition of suffixes. By enumera- 
tion of all such terms show that the number of terms in a 
Pfaffian of order n is 1 . 3 . 5 ... (2n — 1). 

Not© on History of Determinants. — The standard 
works of reference, which the reader should consult for 
detailed information, are Muir’s History of the Theory of 
Determinants (Macmillan), in four volumes, and Contribu- 
tions to the History of Determinants , 1900-1920 (Blackie). 

Vandermonde may be regarded as the founder (1771) of 
a notation and calculus of determinants. Of writers prior 
to 1850 Cauchy (1812), Schweins (1825) and Jacobi (1841) 
made specially important contributions in papers of these 
dates. 

Matrices were introduced by Cayley in 1857, 
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THE ADJUGATE AND THE RECIPROCAL MATRIX: 
SOLUTION OF SIMULTANEOUS EQUATIONS : 
RANK AND LINEAR DEPENDENCE 

21. The Adjugate Matrix of a Square Matrix 

Let us substitute for the i th row of a determinant \A\ a 
new row of elements b The cofactors of the in the 
new determinant are evidently the same as those of the 
corresponding <% in \A\, and so an expansion of the new 
determinant is 

* * (!) 

A similar result holds when a new j th column is substituted 
in A. 

Let us consider the case when the b {j are elements of 
any row of A other than the i th . The expansion (1) is 
then the expansion of a determinant with two rows 
identical, and so gives zero. Thus we have the important 
result : 

a h\\^ix\^ a lo^\ J ^i 2 \A‘ ••• ~h a, Jcn\ J 4-in\ 5=5 & 7^ h • (2) 

and a similar result is again true for expansions in terms 
of a column. 

With respect to the element of a given row (or column) 
of A we shall refer to the cofactors of elements in a different 
row (or column) as alien cofactors. Hence by (2) : 

* Expansions in terms of alien cofactors vanish identically . 

The Adjugate Matrix. Let a matrix M be con- 
structed, the elements of which are the cofactors \A tj \ 
61 
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of elements a i{ of A, but placed in transposed position,. 
Thus 

M = [\Av\Y = [M„|]. . . . (3) 

This matrix is called the adjugate or adjoint of A, and will 
be noted by adj A. Its determinant [adj A | will be 
called the adjugate determinant of A } or the adjugate of 

Ml- 

(In most books on determinants an untransposed 
adjugate is used, but for consistency and unification of 
theory it is preferable to transpose.) 


1. 

r% h °i] 

|&2 c 3| — |£>iC 3 | |&iC 2 | 

HA = 

a z b 2 c 2 * adj A = 

l a 2 C 3| |®l c 8l |°l c 2l 


a 3 ^3 C 3J 

. k&a H®AI I®A1. 


2. 

'1 

2 

3" 


' li 

-9 

1 ‘ 

If A = 

1 

3 

5 

, adj A = 

-7 

9 

—2 


1 

5 

12 


2 

—3 

1 


3. Evaluate \A\ for the matrix of Ex. 2, Also form the 
product matrices A (adj A) and (adj A) A. Try some other 
examples of the 3rd order. 

Forming the product .4 (adj A), and keeping before us 
the case n = 3 for illustration, 


$11 $12 $13 


Mil! M21I Mail 


$21 $22 $23 


M12I M22I M32I 

= 

_^$31 $32 $33 


_Ml3| M23I 1^4-33 1 




>( 4 ) 


we observe that by (2) all non-diagonal elements in the 
product vanish, being expansions in terms of alien cofactors, 
while the diagonal elements are expansions of j A | in terms 
of elements of a row and true cofactors. The reasoning is 
general for any order, and so we have 

A (adj A) = \A\L . . . (5) 

In the same way let us form (adj A) A. The reader 
will assure himself that the diagonal elements are \A\, this 
time by expansion according to elements of a column 
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and cofactors, and that all non-diagonal elements vanish 
as before. Hence 


A (adj A) = (adj A)A = \A\I. . . (6) 

Singular and Nonsingular Matrices. — A square 
matrix A for which the determinant \A | = 0 is said to be 
singular. If \A | ^ D, then A is said to be nonsingular. 

By (6) if A is singular, A (adj A) = (adj A) A =0. 
On the other hand if A is nonsingular we may divide the 
identity (6) through by \A\ 9 obtaining AB = BA = /, 
where 

R = |-4 1 _1 adj A = [\A h \I \A\] = [|4 W | / \A\]'. (7) 

The matrix B thus disclosed may appropriately be 
named the reciprocal matrix of A , and may be denoted 
without ambiguity by A~\ It may be noted that it is 
both a pre-reciprocal and a post-reciprocal, a circumstance 
which depends, ultimately, on the fact that \A'\ = \A\. We 
note also that it exists only when A is square and non- 
singular. Singular matrices have an adjugate, but no 
reciprocal. 

At this stage the long-deferred operation of division for 
matrices may be introduced. If A is nonsingular we may 
operate with it in predivision , as in A~ X B, or in post-, 
division, as in BA and just as in the case of multiplication 
these operations will in general produce different results. 

4. If A is nonsingular and AB = BA, then A~ J B = BA- 1 . 

5. Evaluate A- % B and AB where 


A = 


1 2 
1 3 
1 5 


3 ] [1 

5. , B = 1 
12 Ll 


1 1 
2 3 
4 9 


6. Given only that the pre- and post -reciprocals of a 
matrix A exist, we may prove as follows that they are unique, 
and moreover identical. For suppose B and S ai’e two pre- 
reciprocals, and T a post -reciprocal. Then we shall have 
(R-S)AT = (R-S)I = B-S, while (R-S)AT — T —T — 0. 
Hence R = S. In the same way the post -reciprocal T is 
unique. 
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Again, from RA = I we have (AR — I+R)A = /. But 
since R is unique, AR—I+R = R. Hence AR = /, so that 
is also the unique post-reciprocal of A. 

7. The reciprocal of a nonsingular diagonal matrix is a 
diagonal matrix. For example : 



, A fxv 0* 


S. Let A be a matrix obtained by subjecting the rows of I 
to a certain permutation of order, and let B be the matrix 
obtained by subjecting the columns of I to the conjugate 
permutation. Prove that B and A are reciprocals. For 
example, corresponding to the conjugate permutations (2431) 
and (4132), the matrices below are reciprocal : 



— 

— 



— 

— 


. 

1 . . 



. 

. 1 

B == 

• 

. . 1 

. 1 . 

,K = 

1 * 

I . 


1 

. . . 



. 1 


Note that H' = K, H* 1 «= 

H 

/ 





1 A [z 


‘1 - 

3. 

1 

1 


9. The matrices 

. 1 . 

j 


1 




_• * 1. 


m *■ 

1 



are reciprocal. They each represent (19, Ex. 23) elementary 
operations. 

Principal Minors. In a matrix A those minors 
which have elements situated symmetrically with respect 
to the principal diagonal of A are called principal minors. 
For example the diagonal elements a u are principal 
minors of order 1, their cofactors \A U \ in a determinant 
A are principal minors of order n— 1, and \A\ itself may 
be regarded as the one principal minor of order n. 

The principal minors of a Hermitian matrix are 
Hermitiari. All principal minors in I are equal to 1, and 
all minors unsymmetrically placed are equal to 
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22. Solution of Linear Equations in 
Nonsingular Case 

The discovery of A~ x enables us to give the general 
solution of n linear equations in n unknowns when the 
matrix A of the equations is nonsingular. In fact, if the 
equations in matrix notation are 

Ax = h , where \A\ ^ 0, . . (1) 

we have at once, premultiplying by A" 1 , 

x = A" 1 h, . . . . (2) 

which gives the vector of solutions. Further, since A~ x 
is unique, these are the only solutions. 

To see how the set of solutions appears in ordinary 
notation and in terms of determinants and cofactors, let 
us keep again before us the case n = 3. 


*1 


Mill M*2l| Msil 

pi' 1 


*a 

- Ml " 1 

M 12 1 M 22 I M 32 I 

h 

• (3) 

_*3_ 


1^-13 1 M 23 I 1^-33 i 

^3 



Carrying out the multiplication on the right, and 
observing that every element of the vector obtained is 
bv 21 (1) the expansion of a determinant according to 
elements h { of a substituted column, we obtain 


^1 a l 3 

JIq <7/ 3 2 <^33 


^ 11 ^13 

C&gx ^2 ^23 

a 31 h z « 33 

II 

a ll ®12 
®21 ^22 ^2 
&31 ^32 ^3 


3 “ 



a il a l 2 a Vl 


a il a l 2 U 13 


^11 a i2 a i 3 

& 2X ®22 ^23 


®21 ®22 ^23 


^21 ^22 ^23 

a 31 a Z2 a 33 


^32 a 23 


^31 ^'32 ^33 


This result, which can clearly he extended to the general 
case, gives the rule of Cramer (1750), namely : each x 5 
is a quotient, the denominat or of which is the determinant 
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\A\ of the system of equations, the numerator being a 
determinant obtained from [ A | by substituting the column 
vector { h x \ ... h n } for the j th column of \A |. 

We may note at this point three equivalent methods of 
solving in practice a set of simultaneous equations. The 
first is the method of successive elimination followed by 
resubstitution ; the second is by evaluating the reciprocal 
matrix A -1 and operating with it upon the vector of 
constants on the right ; the third is by Cramer’s rule, to 
evaluate the n+1 determinants of (4) and to form the n 
quotients. It will depend entirely on circumstances and 
on the means of computation available which method is 
used in a particular case. 

1. Solve the equations 

3+ y+ z = 6, 

x+2y+3z = 14, 

3+4y+9 z = 36 
by the various methods. 

The reader should also construct various equations for 
himself, substitute numerical values of x, y, z to find the 
right-hand constants, and then solve, both by Cramer’s 
rule and by forming the reciprocal matrix. The latter 
is a very simple procedure for a matrix of the 3rd order. 

2. Premultiply by A' the set of equations Ax = h. The 
result is the set A' Ax = A'h, a set of equations with symmetric 
matrix A' A. Treat the set of equations of Ex. 1 in this 
way, and solve. 

3. Practise finding the adjugate and reciprocal of various 
matrices of order 3x3 invented for the purpose. Notice that 
when the adjugate is found the determinant \A\ can be 
found in six different ways, by evaluating the diagonal 
elements of -4(adj A) and of (adj A) A. To use at least two 
such evaluations is to provide a useful check. 



RECIPROCAL OF PRODUCT 


57 


23. Reversal Rule for the Reciprocal of a 
Product Matrix 

Reciprocal of Product Matrix. The reciprocal of a 
product matrix is the product of the reciprocals of the 
separate matrices in reversed order. For we have 

AB . B-i-A- 1 = A . 7 . A- 1 = AA- 1 = 7, 
B-*A-*.AB = B-*.I.B = B-'B = I. 

Hence 

(ABC)-* = (AB . O')-* = C-HJB)- 1 = (1) 

and in the same way ( ABCD )“ x = D- 1 C , - 1 jB~ 1 A“ 1 , and so on. 
All matrices concerned are of course nonsingular. 

It is also easy to prove that the ad jugate of a product 
matrix is the product of the separate adjugates in reversed 
order. Thus 

adj (ABC) = adj C . adj jB . adj A, . . (2) 

This is only to be expected, since the adjugate has been 
defined in the first place as a transposed matrix, and so the 
reversal law for the transpose of a product (9) is operative. 

Reciprocal Transformation. If Ax = y, \A\ ^ 0, 
then x = A~ x y. Each of the linear transformations 
indicated by these equations is said to be the reciprocal 
of the other. 

24. Orthogonal and Unitary Matrices 

A square matrix A such that A f A — A A' — 7, that is, 
A' = A" 1 , is said to be orthogonal , for the following reason. 
Consider the distance from the origin to any point in 
n-space in rectangular Cartesian coordinates. The origin 
being denoted by the column vector (0 0 ... 0} and the 
point by x = {x ± x 2 ... x n }> the square of the distance is 
x i-\~ x 2 +•••+#« = x’x. This is unchanged by any 
rotation of axes about the fixed origin, provided that the 
axes remain rectangular, that is, orthogonal. Now the 
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rotation is expressed by a linear transformation A, and its 
effect upon x is to produce the new coordinates Ax. The 
square of the distance from the origin is now x' A' Ax. 
But this is unchanged for all points x in the space. Hence 
we have x'A'Ax = x'x identically in x , in other words the 
quadratic form x'(A'A— I)x is identically zero. Hence 
A' A— I = 0, so that A' A = I. 

Unitary Matrices. The extension of the concept of 
orthogonality to matrices with complex elements is effected 
by defining a unitary matrix A by the condition that 
A~~ x = A', or A' A = A A' = I. A real unitary matrix is 
evidently an orthogonal matrix, so that all the properties 
of orthogonal matrices will be contained as special cases of 
properties of unitary matrices. 


1. Prove that A'A — A A' = J, where 


A 


cos# 

—sin# 


sin#! 

cos#J‘ 


This orthogonal matrix represents the change of rectangular 
axes in two dimensions by rotation through an angle 9 . 

2. Prove that the product of any number of unitary 
(and so of orthogonal) matrices is a unitary (or orthogonal) 
matrix. 

Por orthogonal matrices the geometrical interpretation is 
that a succession of rotations about a fixed origin has the 
same resultant effect as a single rotation. 

3. Prove that the adjugate and the reciprocal of a 
Hermitian matrix are Hermitian. 

4. Prove that the product of two Hermitian matrices is 
not in general Hermitian. Write down a few numerical 
symmetric matrices of the 3rd order, form their products in 
pairs and examine the results. 

5. Prove that the matrix 


i/Vs <i+*)/W 

(1 — i)/V3 -l/\/3. 


is unitary, where i = V — jL 
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6. Let L be the matrix of order 3x3 with rows 

Pi Wh. n ll P 2 ^2 ^ 2 ]* P 3 ^3 ^z\i 

these being respectively the direction cosines of the new 
rectangular axes produced by a rotation in 3 dimensions of 
rectangular axes with fixed origin. Then L is orthogonal. 

Write down in full the 12 relations given by the ©qualities 
LL' = I and L'L = I, and observe that these are the 
“ relations between direction cosines of three mutually per- 
pendicular straight lines,” as given in treatises on coordinate 
geometry of 3 dimensions. 

In n dimensions there are n(n- j-1) corresponding relations, 
all included in the matrix equality LL' = L'L = /. 

7. Let the coordinates of n arbitrary points in 
n- dimensional Cartesian space be taken as the columns 
x {j of a matrix X . Then if A is orthogonal we have 
X'A'AX = X'X, since A' A = I. The diagonal elements 
x' (i x (i of X'X are the squares of distances from the origin 
and are preserved invariant under the rotation A. But in 
virtue of X'A'AX = X'X the non -diagonal elements x' (i x u 
are also preserved invariant. Their geometrical interpretation 
is the product of the distances of the points x ({ and from 
the origin, multiplied by the cosine of the angle between the 
lines joining those points to the origin ; in other words, the 
product of either line and its projection on the other. In 
two dimensions, for example, we have 

x {1 — cos# r^sintf}, x (2 = {r 2 oos(f> r 2 sm<j>} 
and the relation in question is 

[r x cos0 rjsin0]{r 2 cos<£ r 2 sin</>} = r^cos 

25. The Solution of Homogeneous Linear 
Equations 

In the equations Ax — h all the elements of Ax are of 
the first degree in the elements of x, but the constants 
on the right, the elements of h , are of zero degree in the 
elements of x. For this reason the equations Ax = h 
are described as non-homogeneous . On the other hand, the 
equations Ax = 0 are homogeneous. We now inquire what 
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conditions must be satisfied by A in order that the equations 
Ax = 0 may have a non-trivial solution x ^ 0 ; for we 
reject as trivial the obvious solution x = 0. It is clear that 
the vector x , if it exists, will be arbitrary to the extent at 
least of a scalar constant factor, for if Ax = 0 then certainly 
Ay = 0, where y = Xx. For this investigation, as well as 
for the later discussion of m non-homogeneous equations in 
n unknowns, we require the important concepts of rank 
and linear dependence. 

26. Rank and Nullity of a Matrix 

Rank. If A is a matrix, rectangular or square, and if 
all minors of order r-j-1 contained in it are zero while at 
least one minor of order r is not zero, then A is said to be 
of rank r. 

Nullity. If A is square of order nxn, then n—r, the 
complement of the rank with respect to the order, is called 
the nullity of A. If A is rectangular of order mxn there are 
two nullities, a row-nuHiiy m—r and a column-nullity n—r. 

1. A row or column vector with at least one nonzero 
element is of rank 1. 

2. The unit matrix of order nXn has rank n and nullity 0. 

3. The null matrix of order mxn has rank 0, row-nullity 
m, column -nullity n. 

4. The matrix of order mxn m. which ovory element is 
unity has rank 1, row -nullity m— 1, column -nullity n — 1. 

5. If u and x are nonzero row and column vectors of 
orders 1 xn and nXl, show that the scalar ux and the matrix 
xu are of rank 1 or 0. Examine some numerical examples. 

6. The matrix U of order nxn has units (11, Ex. 5) in 
the first superdiagonal and zeros elsewhere. Show that the 
rank of U k is n— k, k**Zn. 

Elementary operations on rows or columns of A , 
involving permutations of order or linear combinations, 
the modified row or column receiving a unit or at least 
nonzero coefficient in any such operation, leave the rank 
of A invariant. For under any permutation of rows or 
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columns the minors of A will receive at most a change of 
sign, but not of numerical value ; and under an operation 

row i +A 1 row 1 +A 2 row 2 +...+A*row ft . . (1) 

any minor \A i | containing elements of row*- will become 

|^i|iA 1 |^ 1 [dzA2|^.2|rb-**=fcA*|-4 ft ], . . (2) 

where \A t |, \A 2 \ , ..., \A k \ are minors obtained from \A { \ 
by substituting in row* the corresponding elements of row x , 
row 2 , ..., row*. respectively. This follows at once from (1) 
by expanding the modified minor in terms of row*. Similar 
remarks apply to operations of linear combination on 
columns. It follows that if all minors of order r+1 in A 
are zero, then all minors of order r+1 in PAQ , where P 
and Q are the nonsingular matrices (19, Ex. 19, 20, 21, 22, 
23) effecting these elementary operations, are also zero. 
In other words, if B = PAQ then B cannot have higher rank 
than A. But P' 1 and Q~ l are also (21, Ex. 9) elementary 
operations of the same kind ; and so since A = P^BQ" 1 , 
A cannot have higher rank than B. 

Hence A and B must have the same rank r. 

27. Linear Dependence of Functions, Vectors 
and Matrices 

Linear Dependence. If a set of functions f l3 f 2 , . . f n 
or of vectors / (1 , / <2 , / (n or of matrices F x> F 2 , ..., F n 

satisfies identically a linear relation 

c l/l+ C 2/2 + -*-+ C n/» = 0 • * • (1) 

or the like with f u or F j for where the scalar constants 
0j are not all zero, then the functions or vectors or matrices 
are said to be linearly dependent. More precisely any / 
affected by a nonzero coefficient c* in (1) is said to be 
linearly dependent on the remaining / J s. If no relation 
such as (1) is possible the f j are said to be linearly 
independent. 
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1. The linear dependence of the functions in (1) can be 
expressed concisely in the form c'f =0, o f ^ 0, where c' is 
the row vector of scalars c j9 f the column vector of functions. 

2. The linear dependence of vectors may be expressed in 
matrix notation. For example, if a a , a (2 , a (s are column vectors, 
linearly dependent in virtue of a relation 

®i a (i+®2®(2+*s®<» = 0 . . . (2) 

this relation is exactly the same as 

Ax = [a (1 a (2 a (3 ] {x x x 2 x z ) = 0, . (3) 

where A is a partitioned matrix of which the vectors a u are 
columns, and x # 0. In the same way, to assert that certain 
row vectors a' {i are linearly dependent i8 to assert that the 
homogeneous equations uA — 0, where the row vectors of H 
are the rows of A f have a non-trivial solution u ^ 0. 

3. If a matrix A is partitioned by rows into {A t A z ), and 
every row of the submatrix ^L 2 is linearly dependent on the 
rows of A lf then A 2 = OA lt where the elements in the 
successive rows of C are the respective coefficients in the 
successive relations of dependence. (The reader should write 
out in full some examples of low order. ) 

4. If the rows of a square matrix A are linearly dependent 
then | A J = 0 ; for there exists an operation on rows, namely, 
that given by the relation of dependence, which will replace 
all the elements of some row by zeros without altering the 
value of \A\. Naturally the coefficient of this particular row 
in the relation of dependence must be unity. 

For example, if the rows of a matrix A of order 3x3 
satisfy the relation roWi— 2 row 2 — 3 row 3 = 0, tlien the 
operation row!— 2 row 2 — 3 row 3 ajjplied to \A\ rodxxces all 
elements of the first row to zero ; hence \A\ = 0. 

5. For the same reason if the columns of A are linearly 
dependent then \A\ — 0. 

Let us insist that we are not yet entitled to assert the 
converse theorems, namely, that if \A \ — 0 the rows and 
the columns of A are linearly dependent. 

6. If every set of r rows of a rectangular matrix A is linearly 
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dependent then A is of rank r — 1 or less, since by Ex. 4 all 
minors of order r in A must vanish. Similarly for columns. 


28. Conditions for Solution of Homogeneous 
Equations 

Let the equations be in n unknowns x t , and m in 
number. In fact we consider Ax = 0, where A is of 
order mxn and of rank r. 

Necessary Condition. The necessary condition for 
Ax = 0, x 0 is that r shall be less than n, the number of 
unknowns. 

Eor suppose r = n, which can happen only if 
Then by a rearrangement, if necessary, of the equations, 
that is, of the rows of A, we can make the submatrix A x 
constituted by the first n rows nonsingular. Partitioning it 
from the submatrix B 1 constituted by the remaining m—n 
rows, we have the equations in partitioned form 



so that A x x = 0, B x x = 0. 


(i) 


Now since A x 
leads to 


is nonsingular A 1 - 1 exists ; and so A x x = 0 
Al 2 A x x — — x 0, 


contrary to the hypothesis that x ^ 0. Hence the 
condition r<n is necessary. 

Sufficient Condition. The same condition is also 
s uffi cient. Tor suppose r<n> and let the equations and 
also the unknowns be arranged, as is always possible, so 
that the leading submatrix A 1 of order rxr is nonsingular. 
The leading submatrix of order (r+1) X (r+l) will of course 
be singular. Let the cofactors of elements of the last row, 
the (r+l)^, of this singular submatrix be taken as the 
first r+1 elements of a column vector x, and let the last 
w—r— 1 elements of x be zeros. Then x ^ 0, since its 
( r _j_l)i7i element is \A X \ i=- 0. We shall proceed to prove 
that x , so constructed, satisfies Ax — 0. 
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It may be helpful to the reader to have an illustration 
under view, in which m = 4, n = 5, r = 2. 


Ax == 


% h c i 
@2 ^2 ^2 
&3 ^3 

U 4 &4 C 4 




= 0. (2) 


All elements in are zero, either because they are 
expansions by alien cofactors (21, (2)) or because they are 
expansions of minors of order r+1 taken from the first r 
rows and some later row of A ; and all such minors vanish. 
(The reader should verify this for every element of the 
product in the illustration.) Hence a solution x ^ 0 has 
been constructed ; and so the condition r<n is sufficient. 

1. If A is of order nXn and \A\ = 0, then r must be 
<n. Hence Ax = 0, x 0 is possible. In other words 
(27, Ex. 2) the columns of A are linearly dependent. In the 
same way, since \A'\ = \A\ = 0, the rows of A are linearly 
dependent. 

The determinant \A\ is often called the eliminant of the 
equations Ax — 0. 


Thus we have the important converse of 27, Ex. 4, 5, 
namely : the vanishing of a determinant both implies, 
and is implied by, the linear dependence of its rows and 
of its columns. 

This converse theorem enables us to discuss rank by 
means of linear dependence. For if B is a rectangular 
matrix of rank r , it contains a non- vanishing minor \A\ 
of order r. Hence the r rows (or columns) of B which 
contain \A\ are linearly independent. On the other hand, 
any r+1 rows (or columns) of B are linearly dependent, 
since otherwise they would contain some non-vanishing 
minor of order r+1, contrary to hypothesis. The important 
conclusion is that the statements (i) B is of rank r, (ii) the 
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maximum number of linearly independent rows or columns 
in B is r, are equivalent. 

2. A symmetric matrix of rank r contains at least one 
non-vanishing principal minor of order r. For since the 
matrix is of rank r , a certain r rows must be linearly in- 
dependent. Hence elementary operations exist which annul 
the remaining rows and, by symmetry, the corresponding 
columns. Hence the minor contained by the r rows and 
columns is nonzero ; and it is evidently a principal minor. 

The extension to the case of a Hermitian matrix is evident. 

3. If m>n , any set of m vectors of n elements is linearly 
dependent. For suppose them to be column vectors of a 
matrix, and annex m—n rows of zero elements to make a 
larger matrix A of order mxm. Then | A | = 0. Hence the 
columns of A are linearly dependent ; and so the original 
column vectors (namely, the columns of A > each stripped of 
its last m—n zeros) are linearly dependent. 

4. If x l9 x 29 ..., % n are scalar numbers all different , the n 
vectors [1 x. ... xj- 1 ] are linearly independent. 

5. Let A be of order n x (n+ 1) and of rank n, the leading 
submatrix A x of order nxn being nonsingular. Then the 
(n-f 1)* A column of A is either itself null, or is linearly dependent 
on the preceding n columns, as follows : 

Let A be written in partitioned form [A x a], where a is the 
last column, and consider 

[A x a] £ j = 0, that is, A x x — a . 


Since \A X \ 96 0 the equations A x x = a have a unique 
solution x — A~ l a . The relation A x x = a is then (27 (3)) 
the desired relation of dependence of columns. The reader 
should try some numerical examples, such as 


A = 


T 

1 

1 


2 1 

3 3 

4 6 



6. It follows from Ex. 5 that any nonzero column vector 
of n elements can be expressed as a linear combination of the 
columns of a nonsingular matrix A of order nxn ; with a 
corresponding result for any nonzero row vector. 


E 
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7. The vectors of the nonsingular matrix are said to 
constitute a basis , or vector basis, in terms of which an arbitrary 
vector of the same order can be linearly expressed. The 
simplest basis is provided by the rows 

e (1 = [l 0 0 ... 0], e (2 =[0 1 0 ... 0], e (n =[0 0 ... 0 1] 

of the unit matrix I 9 and in the same way the columns of J. 
For example 

u = [u x u 2 u$] = tqfl 0 0]~fw 2 [0 1 0]d-w 3 [0 0 1]. . (3) 

The solution which we constructed at (2) for Ax = 0, 
r <n is in general not unique, since any column later 
than the r th could have been moved into the (r-f-1)* 71 
position, thus producing in general a different leading 
submatrix of order (r+1) x (V+l) and therefore different 
cofactors as the elements of x ; for example, elements d t 
or e f instead of c* in the solving vector of (2). The solution 
is therefore arbitrary to an extent which now calls for 
consideration. 

29. Reduction of a Matrix to Equivalent Form 

Let A be a matrix of order mxn and of rank r. By 
elementary operations, rearrangements of rows and 
columns if necessary, we may bring a nonsingular sub- 
matrix A x of order rxr into leading position. The 
operations being equivalent to nonsingular pre- and 
postmultiplications (19, Ex. 19 to 23) which do not 
alter rank (26), we have 



where P and Q are nonsingular matrices of order mxm 
and n X n, and PAQ is of rank r . Since A x has a reciprocal 
Ai 1 the following further reductions are possible, as the 
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reader should verify by carrying out the partitioned 
multiplications indicated : 

I . A 1 B x ]__ \A t B x /o\ 

-Mr 1 l\iA 2 B 2 \-[ . B 2 -A*A-'B J’ W 

UbA [/ -a-^bAJa, 

1/^2 -^2J / J L-^2 ^2 A 2 A^ ^B x 

r i .] \a x b x i \i -a^b; 

L -Mr 1 i\ Ui L- i J 

=== 

-®2 “^2^1 1 -^1 

Now B 2 ~~A 2 A~ 1 B 1 must be null, because if it contained 
any nonzero element % then the submatrix, on the right 
of (2) or (3) or (4), consisting of *A 1 bordered by the row 
and column containing c i5 would be nonsingular, having 
a determinant c^A^ ^ 0. But then the rank would be 
r+1 or more, infringing the theorem that the rank of A 
is invariant under the elementary operations (19, Ex. 23) 
which have been used. 

Hence we have the important results, that if A is of 
rank r it can be reduced by equivalent nonsingular 
transformations PA, AQ, PAQ to respective forms 

[j* ;], [ Al ;], Mil 9*0, . ( 5 ) 

where \A X is of order r. 

We may call the first two of these semi-reduced forms. 
By removing a further nonsingular factor 

[ At i] ■ ■ ■ ■ < e > 

and absorbing it into either P or Q we have the still simpler 
equivalent reductions 



where the leading submatrix 1 is of order rxr. 
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It is also to be noted that the elements of the matrices 
P and Q which effect these reductions PAQ are all rational 
functions of the elements of A. This follows from 
inspection of (2), (3), (4) and (6). 

1. If A is symmetric it contains (28, Ex. 2) a non -vanishing 
principal minor. By a rearrangement of rows, and exactly 
the same rearrangement of columns, this minor may be brought 
into leading position. The transformation effecting this is of 
congruent type P'AP. The transformation of 29 (4) may 
then he applied ; but we may observe that because of the 
symmetry of A we must have A\= A v B 1 = A'^, and so 
(A~ l B^)' — A^A^ 1 . It follows that in this case the trans- 
formation (4) is again congruent, and so for symmetric matrices 
we derive the fully reduced form in (5) by a congruent trans- 
formation of type P'AP. 

_ 2. The reader should go through the same steps for a 
Hermitian matrix A, deriving the fully reduced form (5) 
by a conjunctive transformation of type P'AP. 

The first semi-reduction in (5) enables us to discuss 
the extent of the arbitrariness in the solution of a system 
of homogeneous equations. We may suppose the equations 
and the unknowns rearranged so that the leading submatrix 
A x of the system, is of order rxr and nonsingular. Applying 
the semi-reduction, we may write the equations Ax = 0 in 
the partitioned shape t 



where x has been partitioned into its first r and last n—r 
elements. Hence 

P* 1 B ) ^ = o > or A i x a = (9) 

so that x a = . . . (10) 

This form of the solution shows that the n—r elements 
of x (z can be assigned quite arbitrarily, and that when this 
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is done the r elements of x {1 are uniquely determined. 
Hence the important result : 

If A is of rank r less than n , the number of unknowns, 
and if the columns of A corresponding to a certain r 
unknowns constitute a matrix of rank r, then the remaining 
n—r unknowns can be assigned arbitrarily, and may be 
regarded as parameters in terms of which the other r 
unknowns can be linearly and uniquely expressed. 

3. If x 1 + 2 % 2 + 3 % 3+4#4 = 0 

x 1 +3x 2 +6x d +7x i = 0, 

express x 1 and x 2 in terms of x B and x 4 ; also express x x and 
x a in terms of x 2 and x 4 . 


We further observe (28, Ex. 6) that the number of 
linearly - independent vectors x i2 cannot exceed n — r. 
Hence from the form of the solutions of Ax = 0, namely 


x = 


I" —A x 1 5 1 o; (2 
L ^(2 J’ 


. ( 11 ) 


it is equally clear that the number of linearly independent 
solutions x also cannot exceed n—r. 


30. Consistency and Solution of Non- 
Homogeneous Equations 

The non-homogeneous equations Ax = h can he treated 
in a similar manner by the semi-reduction of 29 (5), which 
gives 

[ A Bi ] fed = p fed - fed- ■ • (1) 

let us say. Carrying out the multiplication on the left 
we observe that k (2 must be zero. If k i2 ^ 0 the equa- 
tions Ax = h cannot be solved ; they are then said to be 
incompatible or inconsistent. 
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The condition for consistency, 7q 2 = 0, can be expressed 
in a more direct way. If we compare the partitioned 
matrices 


we may note that if & (2 = 0 they are of the same rank r, 
whereas if k i2 ^ 0 the second matrix is of rank r+1, since 
at least one submatrix of .order (r+1) X (r-j-1), containing 
A x and some nonzero element of & (2 , is nonsingular, 
its determinant being ki\A x \ # 0. Hence for compatibility 
of the equations Ax = h the two matrices in (2) must have 
the same rank r. Premultiplying by P -1 , which does not 
alter the rank, we have the condition that A and [A ft] must 
have the same rank r. It is customary to call [A h] } con- 
structed as it is by adjoining to A the column of constants 
on the right of the equations, the augmented matrix of the 
system. Hence the criterion : 

The necessary condition for the consistency of a set of 
non-homogeneous linear 'equations is that both the matrix 
of the system and the augmented matrix should have the 
same rank. 

1. Investigate the consistency of the equations 

#1+ “ 4, 

*i+2a? a = 7, 

2^i —J— 5 Xt) = 3 15 

by the above test of rank, and also from first principles. 

If the condition of consistency is satisfied we have 

a x b; 

so that A x x ix +B : 

yielding x ix = —A 


iy<2j 


L lt '(2 ' 


L (l> 


■1 l k(i' 


( 3 ) 

(4) 

(5) 


A i 


and 


A x B x k { 
.. . k { 


• ( 2 ) 
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Here again, just as in the case of homogeneous equations, 
the values of the n—r unknowns in x i2 can be assigned 
arbitrarily, and when this is done the r elements of x a are 
uniquely determined. Hence we have another important 
result : 

If a set of non-homogeneous equations in n unknowns is 
consistent, the rank of the matrix and of the augmented 
matrix being r, then the values of n~r of the unknowns 
may be assigned arbitrarily, provided that the set of 
columns corresponding to the remaining r unknowns 
forms a submatrix of rank r. 

2. Test for consistency and solve : 

& 1 + ^ 2 + ^3 = 6 , 

d” “H 3^3 = 14, 

£B 1 +4a? 8 +7<&3 = 30. 

Criteria for Rank. The following examples embody 
results which serve as criteria for the rank of a 
matrix. 

3. If all the rows (or columns) of a matrix A are linearly 
dependent on a certain r rows (or columns) which are them- 
selves linearly independent, then A is of rank r. 

Partition A into {A x A 2 }, where A t contains the linearly 
independent r rows, A 2 the remainder. Then (27, Ex. 3) we 
must have A % = CA X . Accordingly 

u dia-M. 

that is, an elementary nonsingular operation has reduced 
A to a matrix winch is evidently of>rank r. Hence (26) A 
is of rank r. 

4. If a certain minor l^] of order^r in A is nonzero, while 
all minors of order r+1 containing \A X \ are zero, then A is 
of rank r. 

Without loss of generality, wo may take | A x \ as leading 
minor. Then (28, Ex. 1) the first r rows of A are linearly 
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independent. Let any other row of A be adjoined as (r+l)* A 
row. We shall prove that it is linearly dependent on the 
first r rows. 

For since by hypothesis the leading minor of order r-f 1 
is zero, its (r-fl) tA row must be linearly dependent on its 
first r rows by a relation 

V- 

row T+3s +c 1 row 1 +C 2 row 2 + +c r row f = 0. . (6) 

If we substitute for the (r+1)** column of this leading minor 
the corresponding elements of some lator column, we obtain 
another minor of order r+1 which contains |ri x | and so 
vanishes. Evaluating this minor by applying the operation 
on the left of (6) we reduce the first r elements of its (r-f-1)** 
row to zero; but the (r-f-1)** element must also be zero, 
because if it were 0 then the value of the minor would 
be c|A x | =£ 0, contrary to hypothesis. (Cf. 29 (4).) 

It follows that the operation on the left of (6) reduces all 
elements of the {r+l)** row of A to zero ; in other words, 
the (r + 1)* A row of A is linearly dependent on the first r 
rows, (6) being the relation of dependence. 

Hence every row of A is linearly dependent on the first 
r rows, which are themselves linearly independent. Hence, 
by Ex. 3, A is of rank r. 

The practical consequence is that rank may foe ascertained 
by pivotal condensation, for by the condensation process 
( 20 ) applied to A each pivot is contained in its successors, 
and the pivots involve minors of A of successively higher 
orders. It follows from this and from Ex. 4 that if wo regard 
the setting up of A as stage 1, the first condensation with 
respect to a pivot in A as stage 2, the second condensation 
as stage 3, and so on, then the last stage possessing non- 
vanisliing elements is stage r. This is perhaps tho most 
expeditious way of ascertaining the rank of a matrix. 

5. If A is symmetric and possesses a non-vanishing 
principal minor \A X \ of order r, such that all principal minors 
of orders r + 1 and r+2 containing \A X \ vanish, then A is of 
rank r. 

Without disturbing symmetry we may take \A X \ as leading 
minor, and we may bring any two principal minors of orders 
r+1 and r + 2 containing \A X | into leading position also. 
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Lot us exemplify the three minors by |« 1 6 2 c 3 [ 3 [aj& a c 3 e? 4 [, 
|%^2 c 3^4 e sl* By 20, (8) 


I < ^1^2 C 3^'4 e fil | a l^2 C 3l — 


|a 1 &2^3 C ^4| | <2 '1^2 C 3 e 4| 
| a 1^2 C 3^5| 1^1^2^3®s| 


that is. 


0 — ! a 1^2 C 3 6 4| 2 > 


since \aib 2 c z d^e^\ — 0, \<*Jb%c z d 4 \ = 0 and by symmetry 


|a 1 6 2 c 8 ^5| ” l a i^2 c s e 4|* Hence [^x^25 c a e 4|> a non-diagonal minor 
of order r - {-1 containing [AjJ, vanishes ; and so it follows that 
all such non-diagonal minors, as well as the principal ones, 
vanish. Hence, by Ex. 4 , A is of rank r. 

The reader will 4 e© that the same theorem holds for 
Hermitian matrices. 


* Latent Vectors, Latent Roots. The vectors artd 
scalars introduced below s£re fundamental in matrix theory. 

6. If A is of order nxn ,* row vectors u and column vectors 
q exist such that uA*== Xu, Aq — Xq^ For u(A — \I) = 0 
and (A—XI)q — 0, if \A — AJj = 0 : . This is the characteristic 
equation of A, of degree n in A. Its n rr^ots A^ are the latent 
roots of A ; and SO to each A^ corresponds a pair of latent 
vectors u u , q^, determined but for a scalar factor. 

7 . Prove that |£-AI| = |A-AI|, where B = KAKr\ 
Thus B and A have the same characteristic equation and 
latent roots. The matrix K is nonsingular but arbitrary. 

8. If A is Hermitian Aq — A q gives q'A — A q\ Thus 
each q' is a latent row vector u. 

9 . The latent roots of a Hermitian matrix are real. For 
Aq = gives q'Aq = A q'q % The Hermitian forms (10, Ex. 13 ) 
q'Aq, q'q are real, and q'q is not zero. Hence A is real. 

1 0 . The latent roots of a unitary matrix have unit modulus. 
For Aq = A q gives q'A'Aq = AA q'q* But q'A'Aq = q'q, since 
A' A = 7 . Cancelling the nonzero q'q we obtain AA = 1 . 

The reader should find the latent roots in 24 , Ex. 1, 5 . 

11. Establish step by step A*q = A 2 g, A k q — X k q. 
Also A^q = X~ x q for nonsingular A. Deduce that if i/j{ A) 
is a rational function, i(/(A) nonsingular, then i/j(A)q = i/r(A)g. 

* In a first reading tills section inay be omitted. 
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FURTHER EXPANSIONS : CAUCHY AND LAPLACE 
EXPANSIONS : MULTIPLICATION THEOREMS 

31. Expansion of Determinant by Elements 
of a Row and a Column 

The expansion of a determinant \A | according to elements 
of a row and their cofactors 3 or elements of a column and 
their cofactors, is only one of many possible expansions. 
Consider for example the evident identity 


a lX 

a 12 

a !3 

— °nMnl+ 

0 

a 12 

a i3 

a 21 

a 22 

a 23 

a 21 

a 22 

a 23 

a 3X 

a 32 

a 33 


a 3l 

a 32 

a 33 


in winch the second determinant on the right is what is 
called a bordered determinant, if we regard it as being 
formed by bordering |a 22 ^ 33 ! with a prefixed row 
[0 a 12 a 13 ] and column {0 a 2l a zl j. It is evident that such 
an identity holds for a determinant | A | of any order. * 
Now let us expand the second term on the right of 
( 1 ), the bordered determinant, according to elements of 
its first row a 12 , a 13 , a ln . Every cofactor |^ 12 |, |/1 13 |> 
\A ln \ contains the elements a 2V a zi , a nl of the first- 
column. Let us then expand all those cofactors in their 
turn according to these elements a 2V a 31 , a nl , denoting 
the cofactor of o n in \A tj \ by \A ^ : a |/that is, — |.A 11; #j. 
The aggregate of terms so obtained is 

M- 1 = %i|^n| SEciftdij |-4 ii* jj\, i, j 1 . . (2) 

i j 

This is an expansion of \A\, due to Cauchy, in terms 
of elements of a row and of a column and the joint cofactors, 
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which apart from, the first term will be signed minors of 
order n—2. Clearly the expansion need not be restricted 
to elements of the first row and first column; a corre- 
sponding expansion in terms of the h ih row and k th 
column could have been established; namely, 

Ml == £j | j i =£ Tiy j k. . (3) 

i j 

The reader may easily construct another proof of this 
important expansion by an enumeration of terms thus : 
first show that all the terms in the expansion are different 
and are also terms of J, then show that the number of 
terms arising is (w— 1)1 + {n~l) 2 3 4 5 (n~2)\ i which reduces to nl. 

1. Let a matrix A of order nxn be bordered by a first 
row {0 x x x 2 ... x n } and a first column [0 y x y 2 ... y^\. Then 
the expansion of its determinant in terms of .the elements of 
the first row and column will be 

. . . ( 4 ) 

i 3 

a bilinear form which in matrix notation would be written 
as — ^'(adj A)y* Thus if #'(adj A)y is called the adjoint or 
adjuqaU bilinear form of x'Ay, we have it in the form of a 
bordered determinant 


2. If A is symmetric and y = x the above bilinear form 
becomes the adjoint or adj agate quadratic form of x'Ax, 
namely, a/(adj A)x . If A is Hermitian and y = x, we obtain 
the adjoint Hermitian form of x'Ax. 

The reader should try some numerical examples of low order. 

3. If A = I, the quadratic form and the adjoint quadratic 
form are the same, namely, the sum of squares x'x. 

4. If A is symmetric and nonsingular, express the 
reciprocal quadratic form x'A~ x x as a bordered determinant. 

5. Evaluate several determinants of the 4th order with 
numerical elements by Cauchy’s expansion in terms of 
elements of a row and of a column. -Observe carefully the 
signs of the terms, and check by alternative evaluations. 
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32. Complementary Minors : Algebraic 
Complements or Cofactors 

Let us take the general determinant \A j of order n and 
partition off its first m rows and first m columns. The 
remaining n—m rows and columns constitute a submatrix 
of order (n—m) x (n—m) which would provide a minor of 
order n—m, namely, 

\ a m+lvm+l S+2m+2 *'* a nn\> • (i) 

as represented by its diagonal elements. Now the co- 
efficient of <%# 22 ---GW * n M I evidently 

£i ( 2'm+l,crQ'm-h2,T ••• Unv} * ♦ • ( 2 ) 

where, since the row suffixes 1, 2, ..., m and column 
suffixes 1, 2, m are in natural order, the sign of any 
term depends on the permutation (ot...v) of the natural 
order (m+1, m+ 2, ..., n). But the aggregate of such 
terms is merely the minor (1), which is said to be the 
complementary minor of \a n a n ...a rilm \. In general two 
minors of orders m and n—m formed from complementary 
sets of rows and columns in A are said to be complementary 
with respect to A. 

Next, while leaving unaltered the last n—m columns 
of A, we may systematically permute the first m columns 
in such a way as to bring a lcc a 2 p ... a m[L into the 
diagonal, where (a/?...^) is any assigned permutation of 
(123. ..m). Such operations merely affect \A\ with the 
sign proper to this permutation. It follows that the 
complementary minor \a m+1 , m+1 a m+2 , n +»...a nn \ is not 
merely the coefficient of a ll a i2 ...a mm in \A\; it is the 
coefficient of „ , 

••• a mfx> * • * ( 3 ) 

that is, of |a 11 a 28 ...a i?m |, its own complementary minor. 
In the same way, as we can see at once by bringing any 
minor of order m into leading position, the product of any 
two complementary minors, affected with the sign produced 
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by bringing either of them into leading position, yields 
terms occurring in \A\. 

To discover this sign, consider the minor taken from 
rows i v i 2 , ..., i m and columns j l3 j 2 , ..., j m of A } both of 
these sets being in the usual ascending order. By 

(h~~l)-h(h—2)+ ... . . (4) 

interchanges of rows, and 

OW)+(j 2- 2 )+ - • • (5) 

interchanges of columns such a minor can be brought into 
leading position. Hence in the expansion of \A\ the cor- 
responding product of complementary minors must take on 
the sign factor due to 

h +h + h +j*2 + * • * +iw — m ( m + 1 ) 

sign changes, and since m(m-f-l) is even it can be discarded. 
Hence the desired sign factor is 

(— )h+h+h+h+---+im+3m . . . ( 6 ) 

An alternative way of finding the sign is by inspection 
of diagonal suffixes. For example an d its 

complement |a 31 <z 53 [ show row suffixes (12435), column 
suffixes (24513). The relative class (16) is even, and so 
the desired sign is positive. 

The complementary minor of a given minor, with the 
above sign prefixed, is often called the algebraic complement 
of the given minor in \A\. There is no reason why it 
should not be called its cofactor in \A\, and we shall adopt 
this name. Any minor, with its sign qua cofactor prefixed, 
may also be called a signed minor , as in 19. 

Leaving a minor in its original position in \A\, obliterate 
the remaining elements of \A\. The rows and columns thus 
rendered blank constitute a submatrix. Insert unit elements 
successively in the vacated rows so that this submatrix now 
becomes a unit submatrix. Prove, by bringing the sub- 
matrix of units into leading position, that the determinant 
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so obtained is equal to the signed minor corresponding to the 
original minor. For example 

1 . . . 

a Zl * a 23 

1 . 

. . .1 

a Sl * a 5Z ‘ 

33. The Eaplacian Expansion of a Determinant 

Let us take any m rows of \A\ ; no generality will be 
lost by taking the first m rows. From these we may form 
tt (m) minors of order m, where n ( m) = n(n — 1 ) . . . (n —m + 1 ) /ml . 
Multiplying each minor by its cofactor, a signed minor of 
order n~m, we obtain terms of \A |. None of these terms 
duplicates any other, for they involve different selections 
of rows and columns, and so each term has a different 
arrangement of suffixes from every other term. Counting 
up all the terms of \A\, so arising, we have altogether 

n\ 

n {rn) m\(n~-m)\ = — = n\ . (I) 

m\[n — m)\ 

terms. Hence we have all the terms of \A\ without 
omission or repetition. 

This type of expansion of \A\, according to minors from 
a certain m rows and their cofactors from the remaining 
n— m rows, was first given by Laplace in 1772. Since 
\A'\ = \A\, it is evident that expansion by minors and 
cofactors from complementary sets of columns is equally 
possible. 

1" \ a i^2 C Z^4,\ — | a i^2| | C 3^4|”"l a l C 2| f^S^I |^3 C «I 

| a 3^4|“"|^A| I a 3^4p 

by a Laplacian expansion according to the first two and the 
last two rows of the determinant. 

The reader is invited to write down other expansions with 
various choices of two rows or columns and their 
complementaries. 


°21 

^23 

«51 

a 53 
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2. Construct various determinants of order 4 with 
numerical elements and evaluate them by Laplacian 
expansion, checking the results by Cauchy’s expansion. 

A quasi-Laplacian expansion, according to minors 
from a certain set of rows (the same applies to columns) 
and the cofactors of corresponding minors not from that 
set of m rows but from another set of m rows, vanishes 
identically. This is the expansion Jby alien cofactors. The 
reason is that in any such expansion the elements of at 
least one row of | A ] must occur twice, once in the minors 
of order m and once in the cofactors of other minors by 
which these are multiplied. Therefore the expansion is 
that of a determinant derived from \A\ in which at least 
two rows are identical ; and such a determinant vanishes. 

This result is the natural extension of our earlier one, 
namely that an expansion according to elements of a row 
of \A\ and alien cofactors (21) vanishes identically. 

3. In the determinant \a x 6 2 c 3 d 4 e 6 / 6 | put [a 4 & 4 c 4 d 4 ], 
[h '5 c 6 dg], 6 g c 6 equal to [cq c i [oq 6 2 c 2 d 2 ], 
\a z b 3 Cg d 3 ] respectively ; also put {e 4 e 5 e 6 } and {/ 4 f 5 /,} 
equal to zero. The determinant then vanishes. Expanding 
it in terms of its first three and last three rows, we derive the 
identity 

I a l^2 c 3 1 |^i e 2 fz I “ | a 1 \ C l&2 /s| + | a i C 2^3 1 1 ^l e 2 fz | “ I ^1 C 2^3 1 \ a i e 2 fz I 

= 0 . 

4. By modifying a determinant | 2 ° 7 ^ s 1 hi a 
similar way, so that the last four rows repeat the first four 
except for the fact that {g 5 g 6 g 1 g s } and {h 5 h 6 h 7 h 8 } are null, 
and by expanding the resulting determinant in terms of its 
first four and last four rows, we obtain an identity not unlike 
the above, but of 15 terms, each being a product of two 
determinants of the 4th order. The reader should obtain 
this identity. 

This method of Laplacian expansion applied to vanishing 
determinants may be made to yield many such identities in 
sums of products of determinants. The identities are of 
importance in the theory of algebraic invariants. 
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Before we pass to more general expansions of Laplace 
and Cauchy type, we may prove by the ordinary Laplacian 
expansion the fundamental theorem of multiplication of 
two determinants. 


34. Multiplication ol Determinants 

The multiplication theorem for determinants can be 
expressed in relation to matrix* theory as follows : 

The determinant of the product of two square matrices 
is equal to the product of the determinants of those 
matrices ; and similarly for the product of any number 
of square matrices. In symbols, \AB...K\ = \A\ \B\ ... [isCj. 

Proof. Consider two square matrices A and B of 
order nXn, and take them as diagonal submatrices of a 
partitioned matrix of order 2n X 2n ; of the remaining 
non-diagonal submatrices let the one above the diagonal 
be null and the one below the diagonal be —I. The follow- 
ing identity is then readily verified, all matrices being 
conformably partitioned : 



(The reader is invited to write this identity in full 
non-partitioned form, taking for A and B square matrices 
of the 3rd order, and to refer to this illustration at the 
various stages of the proof.) 

Now the operation of premultiplication on the left of 
(1) is of a kind already encountered (19, Ex. 23) and is 
equivalent to row operations which do not change the 
value of a determinant. Hence, taking determinants, we 
have 


A . 
-I B 


. AB 
-I B 


( 2 ) 


and expanding each side by a Laplacian expansion 
according to the first n rows, we have the desired result 

\A\\B\=\AB\, . . . (3) 
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the positive sign of \AB\ being assured by an inspection 
of leading terms in \A |, |JB|, and \AB\. We deduce at once 

\ABC\ = \A\\BC\~\A\\B\\C\,. . (4) 

and so on for any number of matrices of order nxn. 

Not only so, but since | A' | = \A\ we have the equally 
possible equivalents \AB ’ |, \A’B\, \A'B’\ for \A\ |i?[ ; in 
other words, determinant multiplication can be carried 
out row into row, column into column, and column into 
row, as well as by the customary row into column way of 
matrices. 


1. The reader should here take various determinants of 
the 3rd and 4th order which he has already evaluated for 
practice, should multiply their matrices in the four ways 
just mentioned and should verify that the determinants 
of the results are indeed the products of the separate 
determinants. 

2. Since ^4(adj A) = \A\I we obtain, taking deter- 
minants, \A\ | ad j A | = \A\ n . Hence if A is nonsingular 
|adj A | = \A\ n ~\ a theorem due to Cauchy. The theorem 
expresses a polynomial relation which persists even when it 
is singular. In that case \A\ = 0, |adj A\ — 0. 

3. Prove that sums and products of matrices of order 
2 X 2 of the type 


r a n a i2i 

L— d 12 a.A 


are matrices of the same type. By taking the determinant 
of a product AB of such matrices, prove that the product 
of two sums of four squares can be expressed rationally in 
certain ways as the sum of four squares, and find those ways. 


35. Extensions of the Laplace and Cauchy 
Expansions 

In the terms of a Laplacian expansion the minors of 
order m, or their cofactors of order n—m, can themselves be 
further expanded in Laplacian fashion, as a result of which 
we have an aggregate of terms, each of which is a product 
of three non-overlapping minors ; and these in their turn 
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can be further expanded. We arrive in this way at the 
following generalization of the Laplacian expansion : 

Suppose that the n rows of A are partitioned into a 
certain n x rows, then another n 2 rows, ... and finally 
the n k rows left over, where 


^ 1 +^ 2 + — n - 

The rows in any set need not be consecutive rows in A . 
Let minors, of respective orders n x , n 2 , ... n ki be taken in 
all possible ways from the respective sets of rows ; and let 
products be formed containing one minor from each set 
in such a way that no column of A enters twice, in other 
words, all columns are* used, sign being given by con- 
sideration of the diagonal elements of minors in each 
product. Then the sum of all possible such signed products 
of h minors is an expansion of \A |. 

If this were not otherwise evident from the description 
of the way in which this expansion is derived by repeated 
Laplacian expansion, it could be made evident by enumera- 
tion of terms ; for the number of terms of | A | arising from 
expanding all these products, which themselves number 

n M ( n - n l\n,) - («-%- - -«*-l \ nk y (1) 

will be 


n! 


(»-%}! 




n 2 !(^“-n 1 — n 2 )I 




%!%!... ftglsrn!, (2) 


and this is the number of terms in \A |. 


In particular, if h = n and n x ~ n 2 — ...=%= 1, 
the expansion is merely the sum of n\ signed products 
of single elements, the defining expansion of \A\. 


1. Construct some determinants of the 5th order with 
numerical elements. Partition the rows or columns in various 
ways according to the partition 2, 2, 1 of 5 and evaluate the 
determinants by the extended Laplacian expansion, cheeking 
the results by alternative evaluations. 
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2. Every expansion of \A\ that we have so far used is 
equally applicable to the permanent of A, the sole difference 
being that all terms and all permanent minors receive positive 
sign. The reason is that the argument used to demonstrate 
the expansions is an enumerative one, which applies equally 
well to permanents. For example 

+ + 

.+ + + + + + 

^2 ^2 ^ 1 ^ 2^3 j | | *4"^1 j ^ 2^3 f • (3) 

&3 &3 C 3 


and other similar expansions. 

Extended Cauchy Expansion. Cauchy’s expansion 
of | A | in terms of the elements of a row and of a column, 
which is in essence the expansion of a bordered determinant, 
can be extended to the case where m rows and m columns 
are used in bordering. The bordered determinant is then 
of the form 


. X 
Y' A 


where X = 





for example, and Y resembles X. In the general case A is 
of order nxn, X of order mxn and so Y' of order nxm. 

It is evident by Laplacian expansion according to the 
first m rows of (4) that ifm — n the value of the bordered 
determinant is (— ) W |X| |F|, and that if m>n the deter- 
minant vanishes. 

When m<n a Laplacian expansion of n im) terms is 
possible, by minors of order m in X each multiplied by its 
cofactor of order n in the bordered determinant. Further, 
every cofactor contains all the elements of Y\ Let us 
expand in turn all of these cofactors according to the 
first m columns containing Y\ When this is done the 
previous Laplacian expansion yields an expansion in which 
every term has three factors, the first being a minor of 
order m taken from X, the second a minor of order m 
taken from Y', and the third a minor of order n—m taken 
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from A and involving those rows and columns of A not 
included in the other two minors. The sum of all possible 
products, (n im) ) 2 in number, which can be formed in this 
way, correct sign being given by reference to diagonal 
elements in the factors, constitutes an expansion of the 
bordered determinant which generalizes the ordinary 
Cauchy expansion according to one row and one column* 

A further extension of similar nature concerns a 
bordered determinant resembling that in (4), but with A 
of order nxp, X of order mxp, m<P> Y of order 
nx(m+n—p). 

1. Show that the number of terms in the final expansion 
in products of elements of the bordered determinant is 
( nl) 2 /(n—m)l, and show that the number of terms given by 
the extended Cauchy expansion is the same as this. 

2. The extended Cauchy expansion applies equally well 
to a permanent, all products receiving positive sign. 

3. Expand by the extendod Cauchy expansion 

x x x 2 x a 
u x u 2 u a 

Ci f 

Oj 2 b 2 C 2 

^3 b a c a 

paying careful attention to the sign of the nine terms. 

4. In Ex. 3 put A = —I and observe the form of the 
result. Also put A = —I and Y = X and again observe the 
form of the result. Generalize these results. 

5. In Ex. 3 give numerical values to the elements in 
various ways, and evaluate the bordered determinants thus 
constructed. 

6. Write out completely the expansion the first term of 
which is given below, paying careful attention to the sign of 
terms. 


. 

% 

x 2 

x 3 

^4 

* 

% 


u 3 



«1 

a 2 

a 3 

«4 

V 2 

*1 

b 2 


t> 4 

*3 

Ci 

C 2 

C 3 

C 4 


. X 
Y' A 


Vl 

Vl 

y% 

v 2 

Vz 

Vs 


= \ x i u Mb 3 c i \+ ... . 
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36. Determinant of a Product of 
Rectangular Matrices 


Consider AB , where A is of order mxn and B is of 
order nxm. The determinant \AB\ is of order m. We 
can evaluate it by making partitioned matrices exactly as 
in our proof of the multiplication theorem of determinants 
(34 (1)), and using the same identity 


I A 

r a .i _ 

. AB 

.. 7 

[-1 A - 

-I B _ 


We have therefore 


A . 

1 • ab\ 

-I B 

II 

i, 

to 


. a) 

• (2) 


exactly as before, but since A and B are no longer square 
but rectangular the expansions of these determinants need 
more attention. 

The Laplacian expansion of the determinant on the 
right is \AB\y with sign factor. The determinant on the 
left can be expanded by the extended Cauchy expansion, 
A and B being regarded as bordering — I. It may help 
to keep under view the particular examples given below. 


a Xl a i 2 * 


a ll ^12 a i 3 • 

&21 ^22 


^21 ^22 ^23 * 

a 31 °32 

* (ii) 

—1 . . b n b 12 

1 * ^11 ^12 &13 


. — 1 . b 21 b 22 

— 1 b 2i &23 


. . 1 632 


In the cas e m m>n the expansion is identically zero, 
as we have seen. Hence if m>n we have \AB\ — 0. 

1. This could equally well be proved by adding m—n zero 
columns to A to make a square matrix [ A 0], likewise m—n 
zero rows to B, and observing that 


[-4 0 ] 



= AB. 


The ordinary multiplication theorem of determinants now 
gives \AB\ =0. 
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On the other hand, if m<n, since the only nonzero 
minors of any order in/, or — /, are its diagonal or principal 
minors of that order, the Cauchy expansion (35, Ex. 4) in 
question consists of these minors of order n—m multiplied 
by their cofactors in A and B together. From the symmetry 
of —I, if the cofactor in A comes from a certain m columns, 
the cofactor in B will come from the corresponding m rows. 
For example, fixing sign by leading terms, we have in (ii) 

\AB\^\a lx a 22 \ liu^l+I^iAsI \Wi^z 2 \^\ a i% a 2 z\ (4) 


The general result may be stated thus : 

If A is of order m^n, and B is of order nxm> 
the determinant \AB\ is equal to the sum of all the n (m) 
products which can be made by taking a minor of order 
m from a certain m columns of A and a minor of order m 
from the corresponding rows of B . If m>n, \AB\ = 0. 

The ordinary multiplication theorem of determinants, 
already proved, is the special case m = n. 

This fundamental theorem concerning the determinant 
of a product of rectangular matrices was discovered 
independently by Binet and Cauchy (1812) and is usually 
called the theorem of multiplication of determinantal 
arrays . It can be extended step by step to the case of the 
determinant of the product of any number of matrices, 
provided that the product is square. The enunciation of 
this very general theorem is as follows : 

Let the determinant formed of elements common to 


rows a v a 2 , ..., a n and columns /? 2 , f$ n of a matrix 
T be denoted by 

m [ ^2 \ /z\ 

J \ Q. Q R * * * * W 


a l a 2 a n\ 

A A --.ft/' 


Then if T = AB...BS, where A, B, R t 8 are of order 
kxm,mxn 3 ..., r X s, s X k respectively, so that T is of order 
kxh then 


T 




.R 


( Pipa*-*Pjfc 


)■'■( 


I 2. 
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where the summations are over all sets of k columns taken 
independently from the columns of A, B, R ; or, alter- 
natively, over all sets of k rows taken independently from 
the rows of B, ..., R , S . The number of terms in the 
expansion is therefore 

m (.k) n Uc)--- r Uc) s (k)> • * * (?) 

and the expansion will vanish identically if any of the 
following inequalities be true, 

k>m , 


2. Show that 


. X' 
X A 9 


where X is of order mxn, m>n, and A is symmetric, is a 
certain quadratic form, with the minors of X of order n as 
variables. If m = 4, n = 2, writ© down the matrix of the 
quadratic form. How is it formed from A ? 


37. Expansion of Determinant by Diagonal Elements 


Let us consider the determinant of A +X, where X is a 
diagonal matrix X = [»«] ; for example, to take the case 
n = 4, — | 


^+X = 


a ll+^ll 

a i2 

CO 

iH 

e 


^21 

a 22~\~ X 22 

a 23 

a 2 i 

a 3 1 

a 32 

CO 

+ 

eo 

«34 

a A1 

a 42 

a 43 

Ct U + X - 


( 1 ) 


The determinant is evidently a polynomial in 

z lv x 22> x 23> ^44- Expanding in terms of products of these 
variables and their coefficients or cofactors, we have 


\A+X\ 


+ 

+ 

+ 


*^11*^22*^33*^ 4 4 “f - ^11^22^33^44 ^^22*^11*^33*^44 “h"* * * 
(3 'll Cfc 22 1*^33^44 4” 1^11^33 \ X 22 X 4& “t“ ** * / 

«ll®aa a 33|®4 + |% 1 ^ 22 a 44^3 + *-- * ‘ ^ 

^ 11 ^ 22 ^ 33^4 4 1 * 


In general the expansion consists of a sum of products 
of x u taken m at a time, each product to be multiplied by 



88 


EXPANSION OF DETERMINANT 


the complementary principal minor of order n—m in \A J ; 
the sum being taken over such products for all values of 
m from n to 0. Permanents admit a similar expansion. 

1. Expand the determinant \A\ of the 4th order in this 
diagonal fashion by regarding its matrix as 


— 





"" — 

. 

«12 

a l3 

°14 


a ll • 

®21 

• 

a 23 

a '23 

+ 

. a 22 * 

a 31 

a>zi 


a 34 

a 35 

«41 

«42 

«43 

* 


. . . a 4i 


and expanding in terms of products of the a {i . 

2. Try this mode of expansion on some determinants 
of the 3rd and 4th order with numerical elements, checking 
the result by evaluation in other ways. 


Spur or Trace of Matrix. A specially interesting and 
important case arises when X is not merely diagonal but 
scalar, X = XI. We then derive from (2) the expansion 

\A-\-\I\— A n 4-spiA. X n 1 +sp 2 ^- A n ~~ 2 -i- •** 4~ 8 P W — Ad- |A|, (3) 

where sp r A means “ sum of all principal minors of J A J of 
order r The first of these sums, sp X A, the sum of the 
diagonal elements a u themselves, is important in matrix 
theory and is often called the spur or trace of A . We shall 
denote it by sp A. (Some authors employ tr A.) 


3. Expand 


in powers of A. 


A — a —6 
a A — c 
be A 


4. Let <f>( A) - 1-4.— AJj = ( — )«(A — A x )( A— A 2 ) ... (A — A n ). 
We may also expand <j>( A) in powers of A, in the manner 
of (3). Equating coefficients of powers of A, we deduce that 
sp*4 is the sum of products (c/. 48) of A, taken k at a time. 
In particular spA is the sum, and \A\ the product, of the 
latent roots. Hence if any A^ is zero A is singular ; and 
conversely. 
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* Normal or Canonical Form of a Matrix. In the 

next few examples we assume the A,- to be all distinct. 
The case of repeated roots is important, but beyond 
our scope. 

5. The latent vectors u or q of A are linearly independent. 
For if there were a relation with nonzero c j9 such as 

p = c 1 ^( 1 + c 2 ^(2“f' ••• r\-c k q^ k ~ 0, 
then taking p 9 Ap, A 2 p 9 A^^p we have by 30, Ex. 11, 

+c 2 A2<2 r ( 2 + ••• -^rCjc^kQiTc = 0, s = 0, 1, h — 1. 
These are sets of homogeneous equations in corresponding 
elements of the q (j . For them to have nonzero solutions 
it is necessary (28) that | A^A^ — a|^ x | = 0. But (19, Ex. 11) 
such a difference-product of distinct A y cannot vanish. The 
assumed relation is thus impossible. 

6. The u (i and q are thus rows and columns of non- 
singular matrices U and Q, and so the respective sets of 
relations u (i A = A iU ({9 Aq (f — A can be written as ?74, = AU, 
AQ — QA, where A is the diagonal matrix [A J. 

7. Hence U A U* 1 — A = Q-'t-AQ, so that A has been reduced 
by the similar transformation (of type HAH _1 ) to diagonal 
canonical form A. (For multiple A y this may not he possible.) 

8. We may prove that u (i q {i ^ 0, u {i q u — 0, i=£j. For 

U(iAq {j — A iU^q^ — A jU^qy = 0, since A^A^. Hence the non- 
diagonal elements of UQ are zero. Since UQ is nonsingular 
the diagonal elements u^q^^ 0. Fixing the arbitrary scalars 
in and q (i so that = 1, we have UQ = I. 

9. If A is Hermitian each u = g', by 30, Ex. 8. Hence 
U == and so UQ = J yields = I, whence Q is unitary. 
It follows that $'4LQ = A, a unitary reduction to real 
diagonal canonical form ; real, since the latent roots (30, 
Ex. 9) are real. In particular if A is real it is symmetric, and 
Q is orthogonal. 

The quadratic form x'Ax thus becomes y' Ay , a form in- 
volving squares only, if y = Q” 1 # = Q'x. (Orthogonal reduc- 
tion of a central quadric to principal axes.) 

10. The relations ip{A)q^ = 0(A s )q u of 30, Ex. 11 can bo 
assembled, in view of 7, Ex. 10, into a single matrix relation 
tfj(A)Q ^Qi)j(A). Putting 0(A) = 0(A) = \A — AZj, we have 
0(4 )Q — 0. Since Q is nonsingular, 0(4) = 0. This is the 
Cayley -Hamilton theorem, valid also for multiple roots. 

* In a first reading these examples may be omitted. 
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COMPOUND MATRICES AND DETERMINANTS: 
DUAL THEOREMS 

, 38. Compounds and Adjugate Compounds 
of a Matrix / 

The elements of adj A are the cofactors- of elements a H 
in \A\ ; and the special properties of adj A are intimately 
related to the expansion of \A\ according to elements of 
a row or of a column and their cofactors in \A\. The 
more general Laplacian expansion, in terms of minors of 
order k taken from a certain k rows or columns and their 
cofactors of order n— k, suggests the following extension of 
the adjugate relationship. 

Compound Matrices. Let a matrix be formed the 
elements of which are minors of [^4 1 of order lc ; let all 
minors which come from the same group of k rows (or 
columns) of A be placed in the same row (or column) of 
this derived matrix ; and let the priority of elements in 
rows or columns of this matrix be decided on the principle 
by which words are ordered in a dictionary or lexicon. 
For example, minors from rows 1, 2, 4, of A will appear 
in an earlier row than those from 1, 2, 5 or 1, 3, 4 or 2, 3, 4 ; 
and similarly for columns. We shall call this order lexical 
order. (The usual name is “ lexicographical ”.) The 
matrix with elements minors of order k constructed in 
this way will be called the k ih compound of A and will 
be denoted by A (k K It will be defined in the same way 
even when A is rectangular of order mxn , submatrices 
of order kxk being extracted from A and their deter- 
minants being taken as the elements of A ik) , lexical 
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ordering being observed. The order of A (k) will then 
be m ik) xn (k) . 

Ad jugate Compound Matrices. When A is square 
of order n X n, every minor of order km\A \ is accompanied, 
in a Laplacian expansion of | A\, by its cofactor or signed 
minor of order n — k. Let us take the h th compound 
A ik \ replace every element in it by its cofactor in [A |, 
and transpose the resulting matrix. We thus obtain a 
matrix of the same order as A a \ which may be called the 
k th adjugate compound of A, and will be denoted by adj 
We may notice in passing that A (1) =^ A> adj (1 h4 = adj A, 

1. Take the general matrix of order 5x5 and write out 
in extenao, indicating minors by diagonal elements, the 2nd 
compound and 2nd adjugate compound, and the 3rd com- 
pound and 3rd adjugate compound, comparing the 2nd 
adjugate compound with the 3rd compound, and the 3rd 
adjugate compound with the 2nd compound. The comparison 
will reveal three features of difference. 

2. The k th compound of I is a unit' matrix. The k ih 
compound of a diagonal matrix D is a diagonal matrix, with 
the Jc-axy products of the diagonal elements d ii9 in lexical 
order, as diagonal elements of 

3. The k th compound of A' is (. A <*>)'. 

4. The k th compound of a Hermitian matrix is Hermitian. 

5. The k th compound of a skew symmetric matrix is 
symmetric if k is even, skew symmetric if k is odd. State and 
prove the analogue for skew Hermitian matrices. 

Next let us consider the products 

A™ (adpU) and (adj ™A)A<*\ 

Every element in either product, in view of the way in 
which adj {fc) -4 has been defined, is a Laplacian expansion 
according to minors of order k (in the one case from rows 
of A y in. the other from columns) and their cofactors, 
true or alien, in \A\. In the diagonal elements of the 
products the cofactors are the true, in the non-diagonal 
elements they are the alien cofactors. Hence in each 
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product all non-diagonal elements are zero and all diagonal 
elements are equal to \A\, so that 

A™ &&]<*>A = adj ik) A . A™ = \A | /, . (1) 

all the matrices concerned being of order n [k) . Taking 
determinants we have a theorem due to Cauchy, 

\A™\ |adj<«4| = \A\ n ™. . . . (2) 

Compound Determinants, Sylvester’s Theorem. 

Now \A ik) \ and |adj (fc bd| are evidently both polynomials 
in the elements of A, of degrees n a) k and n {k) (n—k) 
respectively ; and | A ] is itself a prime polynomial. It 
follows that both |-A (fc) | and |adj (fc) J.| are powers of \A\ f 
the possible numerical factor being seen to be 1, either by 
consideration of the term arising from the diagonal elements 
or merely by putting A = A/. Further, since \A j is of 
degree n and \A {k) \ is of degree while |adj <A;, J[| is of 
degree n a) (n—Jc), we have 

|4®| = |adj<«4| = (3) 

using n m hjn = [n— l) {k - x) , and n (k) {n-k)/n — (n—l) a) . 

The theorem in (3) on the value of the k tn compound 
determinant was given by Sylvester in 1853. An important 
consequence is that if A is nonsingular so is A ik) , while if 
A is singular so is A (k) . 

6. If k = 1, we have from (3) that |adj A\ = 
which is Cauchy’s theorem on the value of the adjugate 
determinant. 

7. Jf t A is rectangular, of rank r, the (r + l)** and higher 
compounds of A are zero matrices, but A (r) ^ 0. 

8. = lAl^hn-fc-i) = |adj ( *hl|. 

This is not surprising, in view of the fact (of. 39, Ex. 3) 
that the (n~-Jc) th compound and the k ih adjugate compound 
have the same elements, namely, minors of JA| of order n— k t 
but in the case of the adjugate compound they are transposed, 
written in the complete reverse of lexical order, and have 
cofactor sign imposed. 
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39. Binet-Cauchy Theorem on Product of 
Compound Matrices 


We shall next prove a theorem of central importance, 
first given, in special cases, by Binet and Cauchy in 1812 : 
the k th compound of a product matrix AB is identically 
equal to the product of the k th compounds of A and B in 
that order. 

Let A and B be rectangular matrices of orders mxn, 
nxp , and let us consider A {k \ B {k \ and (ABy k K 

Any element in AB is the product of a row vector of A 
into a column vector of B . More generally any submatrix 
of order kxk in AB is the product A 1 B 1 of the submatrix 
A x consisting of a certain k rows of A } of course not neces- 
sarily consecutive, and the submatrix B x consisting of a 
certain k columns in B . This is most readily seen by 
supposing A partitioned into {A x A 2 } according to those 
k rows and the remaining m — k, and B into [B x B 2 ] accord- 
ing to those k columns and the remaining n—k. For then 


we have 



IBi BJ 


A X B X A X B 2 
A 2 B x A 2 B 2 


( 1 ) 


and A X B X is the submatrix comprised by those k rows and 
k columns in AB. ( Gf . also 11.) 

Now by the theorem on the determinant of a product 
of two rectangular matrices (36 (4)) it follows that \A 1 B 1 J, 
which being a minor of AB of order k is an element of 
(AB) a) , is the sum of products of minors of order k drawn 
from k of the n columns of A x in all possible ways and 
minors of order k drawn from corresponding sets of k rows 
of B x . But the minors from A x are all elements of a row 
of A ik) and the minors from B x are the corresponding 
elements of a column of B i7c) , and so the sum of products 
in question is an element of A {k) B {k \ by ordinary row- 
into-column multiplication. This is true for every element 
of A ik) B {k) , and the priority of elements in rows and 
columns is in strict accord with the lexical convention. 
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The conclusion is that the matrices (. AB) m and A {k) B m 
are element for element identical. Thus 

and so step by step we derive the general result for the 
compound of the product of any finite number of matrices. 

The elegance of this theorem is equalled only by the 
wealth of its applications. Since it rests on the theorem 
of Binet and Cauchy concerning the determinant of a 
product of rectangular matrices, and gives an extended 
matrix expression to that theorem, we shall call it the 
Binet-Cauchy theorem. 

1. The more special theorem concerning the determinant 
of a product of rectangular matrices is that particular case 
of the Binet-Cauchy theorem in which A has h rows and B 
has h columns. Then (AB) ik) and A ik) B ik) each consist of a 
single element. 

The reader should take matrices of low order with litoral 
and with numerical elements, and should familiarize himself 
with the import of these theorems. 

2. Prove that (adj A) {k) = A adj lk) A, 

where A is a scalar, and show that A is a certain power of 
\A\. (Premultiply both sides by A ik) .) 

3. Prove that if A is square of order nxn then 

adj A = H(A'y n ~UH-\ 

where H is a matrix with all elements zero except those in 
the secondary diagonal at right angles to the principal diagonal, 
these elements being 1 and — 1 alternately. 

4. By taking the Jc ih compound of A'A = I and applying 
the Binet-Cauchy theorem prove that the k th compound of a 
unitary matrix (24) is unitary. 

Adjugate Binet-Cauchy Theorem. There is a 
theorem of the same nature as the Binet-Cauchy theorem, 
relating to the k ih adjugate compound of a product. From 

38 (1) ^<*>£<*>(adj<*J£)(adj<*>4) = \A\\B\I, . (1) 
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and we have also 

(AB) {fc, adj (AB) = \AB\I. . . (2) 

Since \AB\ = \A\ |JS| and (AB)™ = A^B™ we derive 

adj (fc) (AB) = adj (?i:) j& . adj {k) A, , . (3) 

and so step by step 

adjW(^j?C)==adj<% . adjW(AB)=adj<*>0 . adj ( *>£. adj<*>4, (4) 

and so on. The reversed order is to be noticed. The 
theorem expresses a set of polynomial identities in the 
elements a i$ , which persist when either or both of A and B 
are singular. 

We may call the theorem the adjugate Binet-Cauchy 
theorem. 

40. The Reciprocal of a Nonsingular 
Compound Matrix 

If A is of order n X n and \A \ ^ 0, we have 
AA^—A^A—I. Applying the Binet-Cauchy theorem 
we have )») = = /, . . (l) 

where I is of order % {k) . It follows that 

(jLW)-i = (A- 1)(», . . . (2) 

in other words the reciprocal of the Jc th compound of A is 
identical with the k th compound of the reciprocal of A. 

In the same way the adjugate Binet-Cauchy theorem 
yields the result that the reciprocal of the k th adjugate 
compound of A is identical with the k th adjugate compound 
of A- 1 . 

Add to these properties the almost intuitive ones that 
the k th compound of a Hermitian matrix is Hermitian, that 
the k th compound of A' is (A (Jc) )', and that the k th compound 
of a unitary matrix is unitary (39, Ex. 4) and it becomes 
clear that there is a far-reaching parallelism between the 
properties of a matrix and those of its compound or 
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adjugate compounds. In fact, from any identity or equa- 
tion involving products of matrices or reciprocal matrices 
we are able, by taking k th compounds of every factor, to 
deduce a parallel identity or equation which is often more 
profound than the original one. 


41. Rank of Matrix Expressed by 
Compound Matrices 

Let A be of order mxn and of rank r. This implies 
that all minors formed from submatrices of order 
(r+l)x(r+l) vanish, while at least one minor of order r 
does not vanish. But this statement can be expressed 
more concisely in the form 

A'*' # 0, A (r+1) = 0. . . . (1) 

Rank of Product Matrices. The Binet-Cauchy 
theorem now enables us to deduce at once certain important 
theorems on rank. 

(i) Bank is invariant under multiplication by a non- 
singular matrix. 

To prove this, consider A and HAK, where A is of rank 
r and \H\^0, |Z|#0. Then 0, A«+» = 0. Also 

(HAKy r +» = H^A^K^ = 0, . . (2) 

since A (f+1) = 0. On the other hand, 

(HAK)M = 0, . . (3) 

otherwise we could divide out by the nonsingular matrices 
H ir) and K ir) and obtain A (r) = 0, contrary to hypothesis. 

Hence HAK is of rank r, as was to be proved. By 
putting K = I we have the result for premultiplication 
alone ; and with H =* I for postmultiplication alone. 

The following theorem concerning the rank of a product 
matrix is proved with equal ease : 

(ii) The rank of a product matrix AB cannot exceed the 
rank of either factor A or B. 
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To prove this, suppose that the ranks of A and B are 
respectively r and s, where Then by the Binet- 

Cauchy theorem 

{ABY r + 1) = ^(r-l)j 5 (r+l) = 0> _ . ( 4 ) 

since A (r+1> = 0. Thus the rank of AB cannot exceed r, 
and a similar proof holds if 

1 . Take the matrices 


'1 . 


"1 . / 


rH 


- 

r 

. 1 . 

. . 1 

’ 

. 1 . 

9 

. . 1 

i 

1 . . 

-* 1 •_ 

y 


form powers and products from them in various ways and 
note the rank, comparing it with the ranks of the factors. 

Make experiments with other examples, choosing 
rectangular matrices of low order. 

2. A matrix such as 


U = 



which satisfies an equation U* — 0 is said to be nttpotent. 
A matrix M such that M 2 = M, and so M* = M for all 
positive integers &, is said to be idempotent. 

Prove that the matrix M = n~ x N, where N is of order 
nxn and has every element n i5 = 1, is idempotent. 

3. By the equivalent reduction HAK of 29 (7) prove that 
if A is of rank r, A {k) is of rank 


42* Jacobi’s Theorem on Minors of the Adjugate 

Any minor of order r in the adjugate determinant of a 
square matrix A is, on expansion, a certain polynomial in 
the elements of A , fixed in form whether A is singular or 
not. To find an expression for this polynomial we may 
refer to the nonsingular case. 

G 
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The h ih compound of 

yields ^ * a< ^i A = adj A . A = \A 1 1 # (i) 

(adj A)™ = (adj A) (k) A (k >= \A\ k I. . (2) 

But by taking the product of compound and adjugate 
compound we have also as in 38 (1) b 

A<»adj<*>A = adj (i, ^l jw = |d|/. . _ (3) 

Comparison of (2) and (3), A and so A™ being non- 
singular, gives 6 

(adj .4)® = |4p-i a djw^[. . _ 

Expressing this in words, we have the celebrated 
theorem of Jacobi (1834) concerning minors of the adjugate • 
Any minor of order k in adj A is equal to the com- 
plementary signed minor in A', multiplied by \A |*-i. 

The relation thus established between the respective 
minors is the desired polynomial relation. It persists 
when A is singular, subject to the proviso that if k = 1 
we must interpret as 1. 

An alternative demonstration, the traditional one, 
does not depend on the Binet-Cauchy theorem, but pro 
ceeds as follows : 5 1 


U'j c 4 d 4 

^2 ^2 ^2 
«3 C. 


- '3 ^3 

«4 C 4 d 4 


i Mai Msl • 

• 1-b.I |b 3 | . 

• Msl IC'a! 1 

• Pal Pal • 


a i ■ • c x 

Ml • C s 

a 3 ■ Ml C 3 

a 4 • • C 4 


( 5 ) 


The columns containing the specified minor of the 
jugate, its elements denoted here by capital letters for 
cofactors, are left m place ; the remaining columns are 
deleted and a unit element is placed in each in the rows 
not containing the minor (as in 32, Ex.), so that the 
complementary submatrix of the minor is a unit submatrix 

f; he r r V X 80 °°f tractcd from ad i A is premultiplied 
by A. On taking determinants we derive 1 

M I X (minor of adj A)= | A |* (compl. signed minor in A'), (6) 
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from which Jacobi’s theorem follows. The reader should 
examine the illustrative example, and should form other 
examples with A of order 5 x 5‘. 

1- Construct adj A for a general matrix A of order 3x3 
or 4 X 4 and verify Jacobi’s theorem for various minors. 

2. Construct several matrices of order 3x3 with simple 
numerical elements, and verify Jacobi’s theorem for selected 
minors of the adjugate. Include some cases in which A is 
singular, and note specially the results when A has rank 2. 

Minors of the Reciprocal Matrix. For nonsingular 
matrices Jacobi’s theorem can be stated simply in respect 
of A ~ 1 . For since A -1 = \A | -1 adj A , any minor of order 
h in A" 1 is equal to the corresponding minor in adj A 
multiplied by \A\~ k . Hence by (6) we have another 
formulation : 

Any minor of order h in A~~ l is equal to the comple- 
mentary signed minor in A', multiplied by \A\~- 1 . 

The phrase any minor may he taken to include \A~ X \ 
itself, the complementary minor in A' being then a minor 
of zero order, which by convention we take to be equal 
to unity. 

The formulation just given might have been established 
at once by comparison of the two identities 

(A- = I and = \A\I, . (7) 

which yields the theorem in the form 

(A- 1 )™ = |-4 adj<*>4. . . (8) 

Theorem of Rellavitis- If the cofactor \A i6 \ of 

in \A\ vanishes while the cofaetor of \a iS a h]c | does not, 
then | A | may be factorized into two rational factors, one 
linear in the elements of the i th row of A , the other linear 
in the elements of the j th column of A. 

For there is no loss of generality in bringing a and 
| a i6 a h!c | into leading position. Then b^.^acebi’s ite^em 
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where \A 12 , vz \ is the eofactor of \a n a 2a | in \A\. Hence 
since \A 1X \ = 0 we have 

\A\ = —\A X2 \ \A 21 \/\A 12) i 2 |. . . (10) 

Now \A n \ can be expanded as a linear function of the 
elements of the first row of A, and \A 21 \ as a linear function 
of elements of the first column. The theorem is thus proved. 

3. Deduce that the bordered determinant 


y 


x' 

A 


L where \A\ = 0 


is the product of two linear functions, one linear in the 
elements of the row x\ the other in the elements of the 
column y. 

4. Deduce a sufficient condition that bilinear forms with 
square matrix (31, Ex. 1) or quadratic forms should be 
factorizable into rational linear factors. Construct some 
examples in two or three variables by taking bordered 
determinants with \A\ = 0, and carry out the factorizing. 
For example : 


. x t 

x i ^ 

x 2 2 


x 2 

2 

1 


= xf~4z 1 x 2 ~l-4x2 = (x x —2x 2 )*. 


43. Frank© ’s Theorem on Minors of a 
Compound Determinant 

Jacobi’s theorem can be extended at once to the case 
of any square matrices A and B satisfying a quasi-reciprocal 
relation AB = XI, A =£ 0. By the theorem on the rank 
of a product matrix both A and B must then be non- 
singular, so that B = A A~ L . It follows from Jacobi’s 
theorem that any minor of order Jc in B will be equal to the 
complementary signed minor in A' multiplied by A & [^4 |“ x . 

For example, applying this to the identity 

adj<*4 .AW = \A\ I . . . (I) 

and using Sylvester’s theorem (38) on the value of the 
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adjugate compound determinant we deduce at once 
Franke’s theorem : 

Any minor of order Ti in the k th compound A (k) of a 
matrix A of order nxn is equal to the complementary 
signed minor in the ¥ h adjugate compound transposed, 
multiplied by | A | h-{n-i)^ . 

Sylvester's theorem is itself a special case of Franke’s, 
bearing to it the same relation that Cauchy’s theorem on 
the value of the adjugate determinant (38, Ex. 6) does to 
Jacobi’s. 

44. The Hybrid Compounds of Bazin and Heiss 

Bazin in 1851 considered a compound determinant 
constructed as follows : Let A and B be two matrices each 
of order nxn and let a new matrix be constructed by 
replacing the j th row of A by the i th row of B in all possible 
ways, the n 2 determinants of the resulting matrices being 
taken as ( i , j) th elements of the matrix so derived. This 
matrix is Bazin’s matrix. We shall call it the first hybrid 
compound of A and B taken in that order, and we shall 
denote it by (A ; B). 

Now (A\B) is in fact the same as B adj A. For 
just as each element in A adj A is an expansion, namely, 
by elements of the i th row of A and cofactors of correspond- 
ing elements of the j th row, so, with B substituted for A 
in the first factor, each element in B adj A is an expansion, 
by elements of the i th row of B and cofactors of 
corresponding elements of the j th row of A. But such 
expansions (21 (1)) are those of determinants obtained by 
substituting the i th row of B for the j th row of A ; and so 
Bazin’s matrix is B adj A, as stated. 

Taking determinants we have | (A ; -B) | = \B\ \A\ n “\ 
Bazin’s theorem. 

1. Put B = 2. Then. (A ; I) = adj A, and Bazin’s 
theorem becomes Cauchy’s theorem on the adjugate. 

2. What is the relation of (I ; B) to B ? Write out an 
example of order 3 X 3. 
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Reiss in 1867 (and Picquet in 1878) considered from the 
standpoint of determinants a more general matrix obtained 
from A and B in a manner resembling Bazin’s, except that 
not one but k rows of B were substituted for k rows of A 
in all possible ways, the determinants of the resulting 
matrices being taken as elements of a matrix, arranged in 
rows and columns according to lexical order of the sub- 
stituted rows. It is easy to see that this Reissian matrix, 
of order %x%), is the same as R (fc) adj (fc h4. For just 
as the elements of A {h) &&] {lc) A areLaplaeian expansions (33) 
by minors from k rows of A and cofaetors of corresponding 
minors from either the same or a different set of k rows 
of i, so the elements in B m &d] {k) A are Laplacian 
expansions in which the first set of minors from k rows of 
A just mentioned is replaced by the corresponding set in B. 
But these are all Laplacian expansions of determinants 
obtained by substituting k rows of B for k rows of A ; and 
the principle of lexical order is observed. Hence the 
matrix of Reiss is B (A:) adj (;fc h4, as stated. 

Taking determinants we have a theorem of Reiss : 

The determinant of Reiss is equal to 

# . ( 2 ) 

3. Bazin’s is the case k — 1. The case gives 

Sylvester’s theorem on the k th ad jugate compound 
determinant. 

4. What is the relation of the k th Reissian of / and B to B ? 

5. Show that the coefficient of X n ~ k {z k in |A.4 ~b fj.B\ is the 
trace of B {Jc) adj {k) A, the k th Reissian hybrid. 

From A and B another Bazin matrix arises, A ad? B. 
Now 

A adj B . B adj A = \AB\I, . . (3) 

and so the two hybrid compounds are quasi -recix>rocal. 
It follows from Jacobi’s theorem that any minor of order 
k in A adj B is equal to the complementary signed minor 
in (B adj A)\ multiplied by a certain factor, which will 
evidently be a product of powers of \A\ and \B\. We 
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leave the reader to prove, by a consideration of dimensions, 
or simply by putting A = I, B = I in turn, that these 
powers are respectively the (h-~~ l) tn and the (h—n- 

In exactly the same way, since 

4<«adj ( w£ . J3<»adj«>4 = \AB\I, . (4) 

*we see that any minor of order h in the h th Reissian hybrid 
is equal to the complementary signed minor in the 
transposed k th dual Reissian, multiplied by certain powers 
of \A\ and |JS|. Once again these powers can be found 
easily, either by consideration of dimensions or by putting 
A = /, B = I in turn and referring to the special case, 
namely, Franke’s theorem ; and so we find the multiplying 
factor to be ^ 

This last theorem, which is also due to Reiss, is the most 
general of the theorems in this class ; the other theorems 
of Reiss himself, and those of Bazin, Franke, Sylvester, 
Jacobi and Cauchy, are successive special cases of it. 

6. Since (B adj A)W = 

the k th Reissian hybrid matrix is simply, apart from a scalar 
factor, the k th compound of Bazin’s matrix ; which suggests 
alternative derivations for the theorems proved above. 

45. Complementary Identities : Extensional 
Identities 

The reader will by now have realised that the appearance 
of generality of several of the above theorems is deceptive, 
and that the really useful theorems are the two on which 
all the others rest, the Binet- Cauchy theorem and Jacobi’s 
theorem. We shall now see how Jacobi’s theorem furnishes 
the means of deriving valuable dual identities and extended 
identities in. determinants. 

Let determinants and their minors be indicated by 
diagonal elements in a single-suffix notation, and let 
cofactors be denoted by corresponding Greek letters ; so 
that for example the transposed adjugate determinant of 
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|a 1 6 2 c 3 d 4 ] will be |a 1 jS 2 y 3 S 4 |. Now consider any homo- 
geneous polynomial identity involving minors of a 
determinant, of any orders. Then such an identity, being 
general, is true also for the transposed adjugate, and so 
may be written with Greek letters. Next, by Jacobi’s 
theorem we may replace every minor of the adjugate by 
its complementary signed minor in A\ multiplied by some 
power of |-4 1 . In such replacements \A\ itself may be 
regarded as the complementary minor of a minor of order 
zero, the latter being taken by convention to be equal 
to 1. This substitution of minors will thus give us another 
identity, which may or may not be different in form from 
the first identity. In any case the original identity and 
this derived one stand in a dual relation, and may be called 
dual or complementary identities with respect to A. 

1. Consider the identity ]a 1 6 2 c 8 | == iK||N>||c 3 ||, where the 
notation on the right indicates that we are to view the 
elements as minors of the first order. Now the complementary 
signed minor of a 19 for example, is |6 2 c 3 |, and so on, and 
applying Jacobi’s theorem we derive the dual identity 

\ a i h i c *V = |adj A\, 

an example of Cauchy’s thoorem on the adjugate. The reader 
will easily extend this to the case of general order. 

2, Consider a determinant \A\ — \a x b z c z d A \ ; partition it 
according to its 1st row, its 2nd row and its last two rows, 
and then expand it as a general Laplacian expansion 

— I a ill^2ll c 3d 4 | — +■•■ 

of 12 terms. Taking complementary minors, and making 
some sign changes in rows and columns, we obtain 

l« A 1 2 = WalK^AI IV A! - KcssIM All V Al +* • •. 

an identity far from obvious. 

But the consequences of Jacobi’s theorem are wider 
still. Any minor such as |a a 6 2 1 is not merely a minor of 
it is also a minor of |a x 6 2 c 3 ri 4 c 5 / 6 f/ 7 |, where 
( e sfdh)> which we shall term an extension of 
denotes the presence of added rows and columns. Now, 
given any identity, we may as before write down its 
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complementary. We can then, take the complementary 
of this, not with respect to the original determinant, which 
would merely give us back our first identity, but with 
respect to the enlarged or extended determinant such as 
|a 1 6 2 c 3 cZ 4 e 5 / s g 7 |. When this is done, every minor in the 
original identity, including minors of order zero and the 
determinant itself, reappears with a tail or extension added 
to it. For example the complementary of 


l«A! = IK I K I I is K*»I = 


K 

— K| 

-K 

. jaj 


(1) 


a very trivial result, but the complementary of this with 
respect to \a x b 2 c z d^\, for example, yields 5 *- 

[ a i C 3^4 [ | 


|&i& 2 c 3 ^ 4 | m 4 | 


L '3 U/ 4 1 
^2^3^ 4 ( 


'4 1 
1^203^4 1 


(2) 


which is by no means trivial and is indeed one of the 
identities on which (21 (6)) the method of evaluating 
determinants by pivotal condensation was based. That 
the identity is extensional in nature is seen by the extension 
(c 3 <# 4 ) which is apparent in every indicated minor. The 
reader will generalize the result. 

The rule so derived for the existence or the construction 
of extensional identities may be expressed as follows : 

If any homogeneous polynomial identity involving the 
minors of a determinant be written in terms of the diagonal 
elements of the minors concerned, then a new identity can 
be obtained by extending or elongating all such diagonals 
by further elements, such as (e 5 f 6 g 7 ), provided that 
homogeneity be maintained by the insertion of powers 
of the determinant of the extension, for example \e 5 f 6 g 7 \. 
Such a determinant is to be regarded as the extension of a 
minor of order zero. 


3. The extensional of 

K&aCsl = KII 6 » c »I-KII 6 i c *i+KII & i°«l 

by the extension (d 4 e 5 ) is 

203^40 511^6 5 1 = i%d 4 e 5 ||& 2 c 3 d 4 e 6 | | | 3^4® 5 1 

-f |«- 3 d 4 0 5 | |6iC 2 f7 4 c 5 | . 
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4 . The extensional of Sylvester’s theorem 

|| (X 1^2lI Cl l C 3[| a i^4||^2 C 3||^2^4il C 3^4|l = I a l^2 C 3^4| 3 > 
where we indicate on the left the 2 nd compound of i% 6 2 c 3 r ^ 4 |> 
by the extension (e 5 / 6 ) yields 

|| a X^2 e 5/6|| a l C 3 e 5/6|***I C 3^4 e 5/6il — \ a i^2 C Z^ e 5f S I e 5^6 f 3 » 
a theorem which, stated for general orders, is often called 
Sylvester’s theorem. It is in fact, as we see, a simple oxten- 
sional of Sylvester’s theorem on the Jc th compound determinant. 
This extensional theorem has a deceptive resemblance to 
Reiss’ theorem on a hybrid compound (44 (2)), which is not an 
extensional theorem. 

5 . A theorem of D’Ovidio (1877) is to tho effect that a 
compound determinant having for elements all minors of 
order k not entirely contained in tho last h rows and A columns 
of A, where A is of order nxn, has tlie value 

where A x denotes the submatrix composod of tho first n—A 
rows and n—A columns of A. 

The matrix of D’Ovidio’s compound is a submatrix of 
A (k \ of order n (k) —h {k} . The complementary submatrix in 
the transposed k th adjugate compound of A is easily perceived 
to be the extensional of the transposed k ih adjugate com- 
pound of A where A 2 is complementary to A x in A , by the 
extension (A'). Thus the theorem is really an extensional 
of Franke’s theorem (43) in a rather disguised enunciation, 
and the exponents are easily found. 

Let us take the opportunity of remarking that the 
extensional nature of an identity may sometimes be 
concealed by a permutation of rows or columns in a 
minor, deranging its diagonal representation. For example, 
if K&2 c 3^4l be written — |a 1 c 2 6 3 ri 4 |, the presence of the 
extension (c 3 <i 4 ) may escape notice. 

Double or Multiple Extensionals. Suppose that 
we had a term such as \a x b 2 1 \cjd z \ in some deterramantal 
identity. We might take complementaries with respect to 
|ai& 2 c 3 d 4 l, let us say, and so derive a complementary 
identity. But then, on returning, we might take the 
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complementary of one factor in a term of the com- 
plementary identity with respect to |a 1 6 2 c 3 ^ 4 e 5 |, let us say, 
and of the other factor with respect to \d'^b^c z d 4 j^g 1 1. 
Provided that all terms are similar, and the procedure is 
carried out uniformly over all, it is quite legitimate, in 
virtue of Jacobi’s theorem* Thus one set of minors 
receives one extension, such as (e 5 ) above, while the other 
set receives another extension, such as ( f G g 7 ) above. We 
may call the identity so derived a double extensional 
identity, and the reader will at once see how, in homo- 
geneous identities composed of similar terms consisting of 
three or more factors each, triple and multiple extensional 
identities may be derived. A few examples of classical 
identities belonging to this type will serve as indications. 

6. The identity 

M»e*||c 1 d,/ 4 |— |a 1 c # e*||& 1 d,/ 4 |+|cs 1 tf # e s ||6 1 c a / ( ,| = 0 

is a double extensional, by means of (e 3 ) and (/ 4 ), of the simple 
identity (Mongo’s identity of 1809) 

— |°h c 2 11^1^2 1 +l%d 2 ||6 1 c 2 | = 0. 

The simple identity may be proved by Laplacian expansion 
of a determinant of the 4th order with repeated rows. 

7. What is the simplest identity of which 

|«iC 4 d 5 | |a 2 6 3 e 6 ] — |a 2 c 4 c? 5 | |aj& 3 e 6 | -+■ |a 3 c 4 a! 6 | |a]& 2 e 6 | = 0 

is an extensional ? Prove the simple identity (Desnanot’s 
of 1819) in the manner of Ex. 6. 

46. Schweinsian Expansions of Determinant 
Quotients 

Quotients of determinants of the type |A 1 6 2 c 3^4l/ \ a i^2 c z^n\ 
and |%6 2 c 3^4l/l^2 c 3^4l are common in applications of 
determinants, the first of them occurring (22 ( 4 )) in the 
solution of simultaneous equations. In the first quotient 
the numerator differs from the denominator in the first 
column only. The second quotient is of the form \A |/|A U |. 
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In the general ease the determinants in numerator and 
denominator are of the n th or the n ih and (n— 1)**, order. 

Such quotients may be expanded in series of a type 
first given by Schweins in 1825. The procedure will be 
sufficiently illustrated by quotients with numerators of 
the 4th order. 


(i) Let — q&, \^i^ 2 c z \ I \ a i^ 2 . c, d\ ~~ ( h- 


Then q^q z = 


2^3^ 4 1 |^1^2 C 3^ 

|Aj&2 C 3 


CCjb^CQ j 


/|u 1 6 2 c 3 | \tt^2 c, /h I • (1) 


The numerator of (1) is clearly an extonsional, by the 
extension (b 2 c z ), of 




and so is equal to 

|&jft 4 &2 C 3| |^1^2^3 1 = I 2^3^ 4 1 
by an even number of interchanges of columns. Hence 

#4 — #3 ^ |^1^2^3 C 4 I \^l C 2^z\l\ a l^2 C s\ |%^2 C 3^4 I * • (^J 

In the same way 

#3~"#2 ” W a ^z\ |^l C 2|/W ; 2l \ a lb2 C z\y * (3) 
#2~#1 = |^l a 2l^l/ a ll a l^2l> • • * (4) 

q 1 = h x la v , . . . * (5) 


Adding now the results (2), (3), (4), (5) we have the first 
Schweinsian expansion 

[ h-^b^C^d^ | / |# A^3^4 1 ^ ^1 / 4* |^x^2 1 ^1 fal |%&2 I 
+ |^iU 2 6 3 i l&x^I/W^I 

4* l^i a 2^3 c 4l | b^c^d z | / |%&2 C 3 | |^ib 2 cy/ 4 |, . (0) 


The general theorem of this kind, for which an exactly 
similar proof may be constructed, follows the above pattern. 

(ii) Let KVai/lVal = Jh- 

1^1^ 2^3^ 4 1 |^2 C 3^4 

\ a \bcf z \ \b z c Q \ 


Then Pi~-Jp z ■ 


l\b 2 c z \ | h z c z d 


4l- 


(?) 
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Here again the numerator is an extensional, by (& 2 c 3 ), of 

= 

and so is equal to 

|^l^2 C 3| = | a 2^3 C 4i |^l C 2^3l? 

again by an even number of interchanges of columns. 
Hence = — |aA c «l l&i^l/IVal • (8) 

and by adding this to" the similar results for p 3 —p 2 , P 2 ~~Pi> 
and p x = a Xi we derive the second Schweinsian expansion 

|%^2 C 3^4|/|^2 C 3^4l ^ a l — a 2^ll^2 — \ a ^z\ |^l C 2|/^2 1 

|u 2 ^3 C 4l l^i C 2^3 1 / |^2 C 3 | |^2 C 3^4 1 ' * * (®) 

The form of the general result, which is proved in the 
same way, is clearly seen. The Schweinsian expansions 
(6) and ( 9 ) have what may be called the “ extra shutter ” 
property, namely, that if a term be added similar to the 
last term but with every minor suitably extended, the 
sum of the series then yields the next higher quotient in 
the sequence. We have assumed throughout that all 
minors in the denominators are fxon- vanishing. 

X. If 3^4! ~~ 9 , | & 2^ 3^^4: [ t*” then 

a x = (^2^ll^2^\ a 2^z\\^l G 2\l^z\^2 C z\~^r\ 0 'Z^Z C 4\\^l C 2^z\/\^2 C d\\(^2 G Z < ^4)> (1^) 
and so in general. 

2 . If u x is a cubic polynomial % = c o +0i#+c 2 # 2 +c 3 # 3 , 
and if 

j\t x dx = m 0 , j\cu x dx = m t , j\c 2 u x dx = m 2 , j\s 3 u x dx = m 3 , 

we may write down corresponding to these five conditions 
five homogeneous linear equations in 1 , c 0 , c x , c 2 , c 3 . The 
condition of consistency is that the determinant of the 
system, namely, 

u x 1 x x 2 x z 

™o 1 i i i 

m i i f J -} ” 0. 

”h i i i i 

**h i i i t 


d/4 

a x 1 



no 
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If the reader now regards this determinant as j«i& 2 C 3 <£ 4 e 6 | 
and applies (10), u x will be obtained in terms of polynomials 



1 

X 


1 

X 

X 2 


1 , 

1 

i 

9 

1 

i 

i 

, 





i 

i 

i 



1 x x 2 a? 3 

1 i i l 
i i i i 


i i 


i 


If these polynomials be denoted by p 0 (x ), Pi(x), Pz(x) 9 p%(%), 
the reader will easily prove that they satisfy the condition 


J* x s p r (x)dx ■ 


0 , 8<r . 


3. Let the polynomial f(x) = c 0 +c x x+c % x* bo given by 
three values j{a 0 ), /(cq), f(a 2 ), whore <7 0 , a X9 a 2 arc distinct. 
Then the consistency of the four equations for f{x), f(a 0 }, 
/(%)>/(# 2 ) * n terms of 1, c 0f c lf c 2 is expressed by the vanishing 
of a determinant of the 4th order. Write it down and, by 
applying (10) to it, obtain an expansion for f(x) in terms of 
f(a 0 ), /(%), /(a 2 ). (The expansion is Newton’s interpolation 
formula of divided differences.) 

4. Expand the quadratic form 


* 

% 

x z 

*3 


(hi 

a 12 

a i 3 

*2 

a 2l 

^22 

a 23 

x z 

a Zl 

a 32 

a zz 


a ll 

a i2 

a 2I 

(l 22 

a zi 

Cl Z2 


^23 

a 3S 


where the bordered matrix A in the numerator is symmetric, 
by the second Schweinsian expansion. Note that the result 
gives the quadratic form in terms of squares of certain linear 
forms in x l} x 2 , x z , 

5. Note that sp.d. <ft) = sp k A. (See 37, (3).) 

6. The Binet -Cauchy theorem applied to Q~ l AQ ~~ A 

(37, Ex. 7) gives ~ A {k) . 

This is clearly a reduction of A lk) to canonical form A lk} , 
a diagonal matrix (38, Ex. 2) with &-ary products of the latent 
roots of A in its diagonal. These A; -ary products are therefore 
the latent roots of A ik K The factor Q tk) also shows that 
the latent vectors of A m are k th compounds of the n ik) 
matrices formed by juxtaposing k latent column vectors of A. 



OHAP Tlfi&rV-I- 

SPECIAL DETERMINANTS: ALTERNANT, PERSYM- 
METRIC, BIGRADIENT AND CENTROSYMMETRIC, 
JACOBIANS, HESSIANS, WRONSKIANS 

In the present chapter we survey a number of special 
types of determinant which occur persistently in the 
solution of certain general mathematical problems. 

47. Alternant Matrices and Determinants 

A symmetric function is a function of several variables 
which remains unaltered when any two of the variables 
are interchanged. An alternating function is one which is 
merely altered in sign by such an interchange. It is evident 
at once that the reciprocal of a symmetric function, the 
product of any number of symmetric functions and the 
product of any even number of alternating functions are 
symmetric functions ; while the reciprocal of an alternating 
function, the product of a symmetric and an alternating 
function and the product of any odd number of alternating 
functions are alternating functions. Constants, being 
independent of the variables, are to be regarded as 
symmetric functions of zero degree. 

1. Examples of symmetric functions : x x +x 2 , #i+# 2 +# 3 > 

x x x 2 +x 2 x 3 -i-x 3 x l9 x x x^c 3> sin(ic 1 +^ 2 )» cos^+a^b 

cosh^ +x 2 ), (x x —x 2 ) z + (x 2 —x z ) 2 -f- (# 3 — x x ) 2 . 

2. Alternating functions : x x —x 2i (x 1 —x 2 )(x 2 —x 3 )(x 3 —x 1 ) f 
(a?i— # 2 ) 3 > sinfo— x 2 ), sinh(x x —x 2 ), {x x + x 2 )/(x x -~: x 2 ). 

A function f(x) is often expressed linearly in terms of a 
in 
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linearly independent set of basic functions p 0 (z), p x (x) 3 
p n -i( x ) as follows : 


f(x) = c 0 'p Q (x)+c 1 p 1 (x)+ ... +c n ^ l p n _ 1 {x). . (1) 


Tor example the functions pj(x) may be powers x j 3 or may 
be trigonometric fimctions cos jx or sin jx. Each special 
problem has its own appropriate basic functions Pj(x). 

When x takes the m values x v x 2 , x m let the matrix 
of order m'Xn which has for its ( i , j) th element be 

denoted by P. Then the set of m values/^) may be regarded 
as a column vector and in view of (1) may be written 

/-Pc, . . ( 2 ) 

where c is the column vector {c 0 c x ... c^}. 

Such a matrix P — [p^-ifa)] of functional values is 
called an alternant matrix , and its determinant |P|, when P 
is square, is called an alternant. It is so called because to 
interchange two of the variables, such as x t and x k3 is to 
interchange the i th and k th rows of |P|, and therefore 
to change its sign. Hence [P| is an alternating function 


m x jr 2 , • 

3. \x° *2 X% I 



1 Xi x\ 


i 4 4 


1 xo o;| 

and j#i x\ ct 3 J ss 

1 xl x\ 


1 #3 x l 


1 xl 


are alternants, die basic functions pj(x) being the powers 
1, x, x 2 and 1, xQ 3 x r respectively. 

4. Write out in full the alternants briefly denoted, after 
the manner of Ex. 3, by \xi cos x 2 sin x z \, |;r? cot x 2 rrjj|. 


Polynomial Alternants. The Simple Alternant. 

The alternant \x\ x \ x \ *•- a#" 1 ! is the product (19, 
Exu 11) of the \n(n— 1) differences of x l3 x 2) ..., x n taken 
in pairs and written in reversed order. We shall denote 
it by |4(012...?i— 1)|, leaving the arguments x t to be 
understood, and we shall call it the simple alternant. 

5. Prove by operations on columns that if instead of the 
powers 1, x, x 2 , x n ~ x we take as basic fimctions polynomials 
of degree 0, 1, 2 , n— 1 having 1, x 9 x 2 , x n “~ l respectively 
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as terms of highest degree, then the resulting alternant is 
equal to the simple alternant |4(012.;.w— 1) |. 

Try some numerical examples of low order, taking poly- 
nomials such as 1, x, x(x—l) 9 x(x—l)(x—2) for basic functions. 

6. Prove that if the basic functions are polynomials of 
degree, 0, 1, 2, ..., w— 1 with respective terms of highest degree 
d 0 , d x x, d 2 x 2 , ..., d n ~ x x n ~ l , then the alternant is equal to 
d o d 1 d 2 ...d n _i|J,(012...n— 1)(. 

The more general polynomial alternant based on n 
powers 1, x r , ..., x t , 0<q<r< mi .<t, may be denoted 
by \A{§qr...t)\. There is no loss of generality in taking 
the first index to be 0, since if it were p we could cancel 
x% from the respective rows of the alternant. 

The alternants \A(0qr...t)\ and |^l(012...w— 1)| are 
easily seen to be homogeneous alternating polynomials 
in x l3 x z , x n . Further, \A(0qr...t)\ vanishes when we 

put any x € = x k , because the i th and k th rows then become 
identical. Hence by the remainder theorem it must 
contain the difference-product \A(0l2...n— 1)| as a factor, 
so that the quotient of alternants 

| A (0 qr. ..t)\/\A(012.. .n^l) | (3) 

must be a homogeneous symmetric polynomial. It is 
called a bialternant , and is fundamental in the theory of 
symmetric polynomials, in algebraic invariants and in 
more general domains of algebra. Determinantal expres- 
sions for it will be given in 49. 

7. Evaluate and examine the bialternants \A (024) / 1 A (012) | , 
].4(014)|/p4(012) | and ]^4(023)|/|^(012)[ in x x , x*. 

48. Elementary and Complete Homogeneous 
Symmetric Functions 

The sum of the n (j) products of x v x 2 , x n taken j 
at a time, without repetition of any in a product, is 
called the elementary symmetric function of degree j in 
the arguments x l3 x 2 , x n , and is often denoted by a,-, 

H 
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the arguments being tacit. We shall use % in this sense 
as far as 50. The sum of the products of the 

x v x 2 , ..., x n , taken j at a time and with unrestricted 
repetition of any in a product (so that powers up to 
the j th may appear), is called the complete homogeneous 
symmetric function of degree j , and will be denoted by h^ 
It is sometimes called the aleph function of degree j, a 
name used by Wronski in 1812. We note that %=0, j>n. 

1. The following are elementary symmetric functions in 
the arguments concerned : x^x^+x^ x^+x^+x^ 

2. The following are the corresponding hji x 1 +x 2 +x i , 
xl+xl+xl+X&z+XjXs+XzX^ Sx\ + HXiXz+XjX&t. 

It is easily proved that a 5 is the coefficient of ( — )H* 
in the expanded form of the generating function 

(1— x 2 t) ... (l—x n t), . . (1) 

while hj is the coefficient of t j in the expansion of 

(l+x l t+3#*+...)(l +x z t+x$*+-)-( 1 +x n t+x%*+...), (2) 

the terms in t 2 , t 3 , ... in the factors of (2) taking account 
of the repetitions of the Now the generating function 
(2) is visibly the reciprocal of (1), and its expansion is a 
series which converges if i~ x is less in absolute value than 
any of x v x 2 , x n . Consequently by multiplication of 
(1) and (2) 

1— (1 — +( — } n d n t n )(l ...), (3) 

from which by comparing coefficients of powers of t we have 

a k\ ~ a k- 1^1 + + ( — ' ) k Cl 0 h k = 0 , 

where by convention a 0 = h 0 = 1. These relations, which 
are evidently unaltered when the roles of the a i and h^ 
are interchanged, are called Wronskfs relations. Let us 
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observe that they are .expressed completely in matrix 
notation by the statement that _ 

°° z° * 

~ a i a ° Al n ° 

A= a 2 — a x a 0 "and H= h 2 \ K ( 5 ) 


are reciprocal matrices, as the reader should verify by 
proving that AH = HA = I. Indeed since the deter- 
minants \A\ and \H\ are both equal to 1, the matrices A 
and H are mutually adjugate as well as reciprocal. 


49. Bialternant Symmetric Functions of Jacobi 

Let us take three arguments x v x 2 , x 3 , or, for greater 
ease in printing, a, p, y and let us consider h r {a, p,_ y)- 
The terms in this symmetric function may be divided, into 
two groups, those containing a and those not containing a. 
The latter set clearly gives h T {p, y ) ; and slight considera- 
tion will show that the former set gives ah^a, p, y). 

Hence h r (a, j8, y) = Mfr y)+ah r - 1 (a, /3, y) _ 

= h r ((t, y) +^T-i( a > y) 

whence it follows, on subtraction, that the divided 
difference^ ^ /„ «A\HR— a )=h r - 1 {a, fi, y)- ■ ( 2 ) 

A corresponding identity can be proved in the same way 
for any number of arguments. Now let us take the 

alternant 1 a r a s a t 

|A(CM| = \ yr yS yt • 

1 S r 8 s S* 

and subject it to the following operations: 

and remove the factor 0-cl ; vow.-row, and remove 

y—a; row 4 — row! and remove 8— a ; then row 3 row* 
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and remove y— j8, row 4 — row 2 and remove S—/3; and so 
on. By repeated use of (2), with the evident fact at the 
first stage of h r { a)=a r and so on, we thus arrive at 

1 h r ( a) & 8 (a) A^(a) 

• V-iK P) h-i(a-, P) K-\ (a, P) 

■ K-i(. a > P> y) K-zio-, P, y) h- s(«> P> y) 

• Vst®. P> y, S) P> y, S) 7i{_ 3 ( a, p, y, 8 ) 

Next let us perform row 1 +j3row 2 , row 2 +yrow a , 

row 3 +8row 4 ; then row 1 +yrow 2 , row 2 +Srow 3 ; then 
row x +8row a , using (1) each time. The factors previously 
removed, /3— a, y—a, S— a, form the difference-product 
[4(0123)1, so that writing hj{ a, /?, y, 8) = hj we have now 

7i 0 7i f ft* ft$ 

\A(0rst)\/\A(0m)\ = ‘ J" 1 , . (5) 

• "'r — 2 10 t — 2 

* ^r~3 ^a-3 ^4-3 

which, if the order of rows and columns he reversed, is 
seen to be a minor belonging to consecutive rows of \H\ 
in 48. Let us observe that when the first row is given, 
or the diagonal suffixes 0, r—l, s—2 t *—3, the 
bialtemant (5) is completely determined ; also that 
the sum of the diagonal suffixes (r — 1) -+-(^ — 2) -f- — 3), 
where 0<r— 1 — 2<£— 3, is the degree of the bialtemant. 
The diagonal suffixes thus constitute a partition of the degree. 

1. The partitions of the integer 4, arranged in ascending 
order, are 1 + 1 + 1 + 1, 1 + 1+2, 1 + 3, 2+2 and 4. There 
are therefore five bialtornants of the 4th degree, which the 
reader should write out in full. The last two, for example, are 



1 7t 0 h 4 

t * ft ^ ft$ 

I • K h 2 




and since h 0 — 1 the first row and column in these may be 
deleted. 

2. [4(012*} |/|4(0123) | - V 3 , |4 (01 234) 1/(4(01234) = ft,_ 4 . 
and so on. 
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The determinantal form (5) for a bialternant is due to 
Jacobi (1841) arid can be derived for a general bialternant 
by the method we have exemplified. We may notice two 
features in the bialternant : (i) the suffixes of the in 
its first row are the indices of the alternant in the numerator 
of the quotient of alternants ; (ii) the sum of the suffixes 
in every term of the expansion of a bialternant in its 
determinantal form is equal to the degree, for example, 
(r— l)+(<$— 2)+(2 — 3) in (5). The sum of the suffixes is 
often called the weight , and a polynomial in which each 
term is of the same weight is said to be isobaric. 

Dual Form of a Bialternant. By the aid of Jacobi’s 
theorem on minors of the adjugate we may obtain an 
alternative form for the bialternant as an isobaric deter- 
minant like (5), but with elements % instead of h^ For 
suppose the matrices A and H in 48 (5) to be of order 
(t+-l)x(t+l). Then the lower left corner element of H 
is h t , and the upper right corner element of A' is (— Ya t . 
Let rows be named by the highest suffixes of the h d or the 
dj which they contain. The rows of H are therefore 
named by 0, 1, 2, t and those of A' by the reversed 
order t , £-4*1, t— 2, 2, 1, 0. Now the bialternant (5) 

for example, when transposed, belongs to rows 0, r , s, t of 
H y columns 0, 1, 2, 3. Hence, by Jacobi’s theorem, the 
complementary bialternant in A' belongs to rows obtained 
by removing 0, r, s, t from the sequence 0, 1, 2, t and 
taking the complements with respect to t (because of the 
reversed order of rows in A') of the numbers that remain. 

For example, if 0, r, s , t were 0, 1, 3, 6 the numbers not 
contained are 2, 4, 5 and their complements with respect 
to 6 are, in ascending order, 1, 2, 4. Hence we have the 


identity 


K h'l hr 6 JIq 

• K h h 

. . K \ 


a x a a 
cc Q d x d 3 , 

. C&Q $2 


• ( 6 ) 


where we have made all elements of the determinant on the 
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right positive by altering the sign of the first row and 
second column in the corresponding signed minor of A'. 

The proof which we have here sketched in outline from 
an example is general, and establishes a dual or com- 
plementary relationship between bialternants expressed in 
elements h j or in elements Sets of integers like 0, 1,3,6 
and 1, 2, 4 are said to be bicomplementary with respect to 
the highest index, such as 6 here, and the theorem of duality 
between bialternants may be succinctly expressed thus : 
the suffixes of elements in the first rows of dual bialternants 
form bicomplementary sets. 


3. Find the sets bicomplementary to (0125), (0235), (025) 
and (01245) with respect to 5. Write down the corresponding 
identities between dual bialternants. Interchange h with a 
and note that all the identities remain true, because of the 
symmetry of Wronski’s relations, 48 (4) and (5). 

4. Prove that h r is equal to a bi alternant of order r with 
all diagonal elements equal to a v Hero again h and a may 
be interchanged. 


Reciprocal of Alternant Matrix. 


A (012) is 


Pv 


l 
l 

l 1 y 


a a * 

P P 
y\ 

ya 


(a—/?) (a -~y) (fi -y) (0 -a) 


ft+y __ y+a 

(a /})(a y) (j8-y)(/ 3-a) 

1 1 


The reciprocal of 


aj 3 

(y— a)(y— j3) 

a+P 

(y-a)(y-f}) 

1 



_ (a—P)(a—y) (r - yN j~*) iy-a}(y-/J)J 

The numerators in the elements of the respective columns 
are elementary symmetric functions in the arguments 
a, j8, y with one argument omitted each time. The nature 
of the polynomials in the denominators of the elements is 
clear from the example. The reader will be able to extend 
the result to the case of a general alternant of order nxn. 
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50. Confluent or Differentiated Alternants 

A polynomial f(x) of the n th degree is uniquely deter- 
mined by the values which it assumes for n+1 different 
values of x . When two or more of these points coincide 
the polynomial is no longer determinate ; but if we are 
provided with the values of a sufficient number of deriva- 
tives of f(x) at the points of coincidence f(x) again becomes 
determinate. For example a polynomial of the 4th degree 
is full y determined by the values of /(a), /'(a), /"( a), /(/?), 
/'(/?), provided that a and /3 are different. The condition 
of consistency which underlies the five equations involved 
is expressed by an alternant matrix in which the elements 
of certain rows have been differentiated. We shall call 
such a matrix a confluent alternant matrix, and its deter- 
minant a confluent alternant. 

For example, if f(x) = c 0 +c 1 x-j-c 2 x 2 -i~c 3 x 3 +c 4 x 4c . (1) 

then the expressions for /(a), /'(a), /"( a)/2!, /(/ 3), /'(/ 3) give 
five equations in the coefficients c 5 with matrix 

1 a a 2 a 3 a 4 

. 1 2a 3a 2 4a 3 

. . 1 3a 6a 2 , . . . (2) 

1 j3 jS 3 /3 4 

. 1 2/3 3/3 2 4/3 3 


an example of a confluent alternant matrix. The operations 
on rows are of the type (djdx) k llc\. 

1. Construct the confluent alternant matrices corresponding 
to /(a), /'(a), m, f(P) and to /(a), /'(a), f(a)/2!, /'"(a)/3!, 
where fix) is a cubic polynomial. 

Evaluation of Confluent Alternant Determinant. 

Let us consider , „ « d 

1 a a 2 a 3 a 4 

1 p /3 2 /3 3 /3 4 

| A (01234) | = 1 y y 2 y 3 / 

1 8 S 2 S 3 S 4 

1 e e 2 € 3 e 4 

V 



(3) 
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in ■which a, y are going to coalesce to the same value 
a, and 8 and e to the same value 8. Let us apply to rows 
1, 2, 3 and to rows 4, 5, separately, the first set of operations 
used on the alternant of 49 (3). We thus obtain 

|.4(01234) | /{J (a, f3, y)A(8, «)} 

1 a a 2 a 3 a* 

. 1 A 1 (a, jS) h z (a, /3) h 3 (a, ft) 

= ■ ■ 1 P> y) • • ( 4 ) 

1 8 8 2 8 3 8* 

. 1 ^(8, e) h 3 ( 8 , e ) k 3 (8, <=) 

ISTowsince by 48 (2) thefunction{(l~~ a£)(l— fit) (l 
generates the symmetric function k/a, ft, y) as coeflieient 
of t j in its expansion, the generating function when 
a=/3=y is (l—a£)” 3 , -which can he expanded by the 
binomial theorem. In the same way when S ~ e the 
generating function of hj(8, e) is (1 — 8*) ~ 3 . Kxpanding 
these generating functions we see that when a, ft , y 
coalesce to a and S, € coalesce to 8 the deterim inant on 
the right of (4) becomes 

1 a a 2 a 3 a 4 

1 2a 3a 2 4a 3 

. . 1 3a 6a 2 * * (5) 

1 8 S 2 S 3 S 4 

. I 28 38 2 4§ 3 

But this is the confluent alternant. Hence its value is 

lim d(a, ft, y, 8, €)/{A(a, ft, y)A(8 } e)} 

= lim {e —y) {€ -— ft) {e — a) (8 — -y) {8 — ft) (8 — a) = {5 a)«. . (6) 

ft, y->a, c->8 v 

<# 

This function may be called the confluent dijfln-nca- 
product . It may .be denoted by 2l{(aaa)(S8)}, being 
regarded as constructed from differences of arguments taken 
one from each set (a, a, a) and (8, 8) in reversed order, the 
zero differences of elements from the same, set bring 
excluded. 
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Following the above lines the reader will show .that the 
value of the general confluent alternant in which elements 
coalesce to a v n 2 to a 2 , and finally n k to a k is 

ax,..., a.i) {a 2 , a^)} == II ( ctjj. — a^) n ^ n k f i<C.k. (7) 

Reciprocal of Confluent Alternant Matrix. The 

problem of finding the reciprocal of a confluent alternant 
matrix may be approached by interpreting A^P* 1 as 
operations on columns of A~\ where PA is the result of 
elementary operations (19, Ex. 21) on the rows of A . 

In view of 21, Ex. 7 and 9, it is seen that if P in pre- 
multiplication effects operations row*— Arow 7 „ p, TOW h , then 
P~ x in postmultiplication effects col fc +Acol*, p, _1 col A . 
Now a confluent alternant is derived (50 (3), (4), (5)) 
from a ample alternant A by operations PA like 
row x , (j8— a ,~row 2 y— i S)~ 1 row 3 and 

so on. Hence the reciprocal confluent alternant A^P- 1 
may be determined from A~ x (for this see 49 (7)) by 
operations like coli+colg, (/?— a)col 2 , colg+colg, (y — yS) col 3 
and so on, and then permitting coalescence of appropriate 
arguments. 

It will be found instructive to prove, both by this 
method and by direct formation of the reciprocal, that 

y(y — 2a) ya a 2 

(a-y) 2 SL—y (y— a) 2 

2a y- f-a 2a /q% 

(a— y) 2 a— y- (y— a) 2 

1 1 1 

(a— y) 2 a— y (y— a) 2 

51. Persymmetric, Circulant and Centro- 
symmetric Types 

A determinant in which all elements in any diagonal 
at right angles to the principal diagonal are the same was 
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formerly called orthosymmetric. Adopting a more modem 
name we shall call both the determinant and its matrix 
persymmetric. For example the matrix 


«1 

a 2 

®3 


«2 

«3 

«4 


®3 

«4 

«5 

®< 

a 4 

“5 

«6 

a. 


• (i) 


is persymmetric. Evidently persymmetry of a matrix 
A s= [a#] is completely characterized by the statement 
[%] = [d-i Also the number of indcj indent elements 

in a persymmetric matrix of order nxn is 2n— 1. 

Persymmetric determinants \A\ with special elements 
can often be evaluated more easily by first forming DA, 
DAD' or LA, LAL where, taking n =*= 4 for illustration, 
we have 


1 ... 


' 1 . 

-1 1 


-A 1 

,1-2 1 

= 

A 2 -2A 1 

-^1 3-3 1 


-A 3 3A 2 -3 A 1 

* 

" * 

— — . 


the coefficients appearing in the rows being the binomial 
coefficients o£ successive order. Since A a x = a 2 ~~~ a x , 
A 2 a t = and so on and [Z> j and |i| = 1, we 

find the following equivalents for the determinant \A\, 


\DA \ = 


&1 cig cq 

Aa t Aa 2 Aa z Aa 4 
A z a x A z a 2 A z a z A % a 4 7 
A z a x A s tt 2 d z a z A z a 4 


\DAD'\ 


a 1 Aa x A 2 a l A z a l 
Aa t A 2 a l A 3 (z x A 4 a l 


(3) 


or the same withZl^ instead of 2P, where A^ is the operation 
of A-differencing defined by A^a x = a 2 ~Xa v We may 
express these results in concise form as 
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1. Evaluate 1 3 G 10 

3 6 10 15 

6 10 15 21 

10 15 21 28 

after first constructing a table of successive differences of 
1, 3, 6, 10, ..., 21, 28. 

2. Evaluate 1 n n {2 ) n (z) 

n n i2 ) ^( 3 ) ^( 4 ) 

n (2) n (Z) n U) n (5) 
n (3) n (4) n ( 5 ) n (6) 

whore n ir) = n(n— l)(n— 2) ... r+J)/r!, by repeated use 

of the identity (n-|-l) (f .)~w (r) = n (r _ 1) . 

♦ » 

Circulant Matrices and Determinants. A matrix 
of order n X n of the type 

®1 ^2 ^4 

q % #2 ^3 

a 3 0*4 % a 2 

* (Ly, d Oj 

illustrated by the case n == 4, is called a urMant miztrix, 
and its determinant is called a circulant. There are only n 
independent elements. They appear in the first row,* and 
the elements in successive rows are the successive cyclic 
permutations of these. By inspection G is persymmetric 
about the secondary diagonal. ^ 

If all later rows of G are added to the first ro# f each 
element in the resulting first row is equal to a x +a 2 +. . . +a n . 
This therefore is a factor of the circulant |0|. 

This is only one of n factors, the typical factor being 
& 1 +a 2 co i +a 3 o^+ ... +a n a> n r~ 1 > where a> 5 is an n th root 
of 1. To see that this is so, let us construct Q^GQ, where 
O is a diagonal matrix of the type exemplified by 

1 

co 
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the diagonal elements being the powers of a primitive nP* 
root of 1. The effect, as the reader will verify, is to 
produce a new cireulant in which a k lias been replaced by 
a k w*" 1 . At the same time |6 Y | is unaltered in value, since 
|Q|-*|<7| |Q| = \G\. Hence by adding as before the later 
rows to the first row it is seen that a l -{ a 2 oj -j- ... +a„m n ““ x 
is a factor of or | C\. Thus \C\ contains the con- 

tinued product of n such factors ; and since the leading 
term derived from the diagonal elements is a% the only 
remaining factor must be 1. The result in question is thus 
established. 


a 2 a z ) 

(a t + a 2 +a $ )(a t + o a a 3 )(rq + <i> a « a »f ou 3 ), 

where <o and <o 2 are the complex cube roots of 1. 

5. Write down the corresponding factors of the cireulant 
of order 4 and verify by actual construction of their product 
that it is equal to the cireulant. 

6. The sum and the product of two cireulant matrices, 
and the reciprocal of a cireulant matrix, arc cireulant. 


3. CX>2. I 

C*2 I 

4. ci< i ct 3 

a 3 a x a 2 

Ct>2 & 3 


Centrosymmetric Matrices and Determinants. 

A matrix which is symmetric about the centre of its array 
of elements is said to be centrosymmetric. Thus if 
K is centrosymmetric of order n x n we must have 

Or again, if we make use of the matrix J which has 
units in the secondary diagonal and zeros elsewhere, and 
which in premultiplication reverses the rows of a matrix 
and in postmultiplication reverses the* columns, we may 
express a centrosymmetric matrix as a partitioned matrix 


K = 


A B 
JBJ JAJ 

J 


or K — 


A a B 

b' a m ' m b'J 

JBJ Ja JAJ 


( 7 ) 



D1ALYTIC ELIMINATION 


ns 


according as K is of order 2m x 2m or (2m — l)x(2m— • 1), 
In the second ease a is a column vector, 1/ is a row vector, 
and a mm is the central element. 

In either case a centrosyrnmetric determinant can be 
resolved into two determinant factors, for in the first 
wo have 


7 J 


‘ A 

B ' 


7 -J] \a+bj 



JBJ 

JAJ _ 


l\ [ JBJ JAJ —JB 


as the reader should verify, interpreting also the operations 
as row and column operations upon K. Thus by Laplaeian 

expansion \K\ = \A+BJ\ \JAJ -JB\. . . (9) 

In the second case we have 



“ A 

b' 

JBJ 

~A+BJ 

b' 

JBJ 


Ja JAJ 

2a 


a c ■ i] 


Ja JAJ —JB 


which by Laplaeian expansion gives 


1*1 = 


A ~\-BJ 2a 

b' a mm 


\JAJ-JB\ 


, (10) 


( 11 ) 


the second factor being of the same form as in (9). The 
factors here are of orders m and m— 1. 


7. Construct centrosyrnmetric determinants of orders 2, 3, 
4 and 5 with literal elements and factorize them. 


52- Dialytie Elimination : Bigradient Matrix 

Consider two algebraic equations, such as 

a ( pc i +a 1 x+a 2 = 0, . . .(I) 

b^+bjzP+bp+bs = 0 , . . ( 2 ) 

in the general case of degrees m and n. Let it be asked 
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what condition the coefficients % and bj must satisfy, sa 
that the equations may have a root irf eonpnon. •Multi- 
plying (1) by 1 , x, x 2 and (2) by 1 , lb iff turn^wfc cStam 
five homogeneous equations in 1, % x 2 , condi- 

tion for their consistency is that tlyjir d^™^***^ - * - 
a 0 ct>i &2 . . 

. a 0 a 1 a 2 . v ■ 

. • a 0 a x a % ** 0. 

* ^0 ^1 ^2 ^3 

*0 ^1 ^2 ^3 * : ,, * 

Such a determinant, or such a matrix, is said to fee bi- 
gradient. (We have conformed tp custom in making the 
persymmetry in the a 5 and bj run in two different slopes.) 
In the general case the bigradient is of order m-\~n> with 
n rows of elements % sloping down persymmetriealfy, and 
m rows of elements bj sloping up. The process of eliminating 
x in this way from two pplynomials|s due to Sylvester, and 
is called dialytic elimination . 

1. Express by means of a bigradient the condition that 
the quadratic equation a^c 2j r a l x-{-a 2 = 0 should have a 
double root. Evaluate the bigradient. 

2. In the same way find the condition that the general 
cubic equation should have a repeated root. (The second 
polynomial in each of these examples will b© the derivative 
of the first.) 

53. Continuant Matrices and Continuants 

A matrix G n of order nxn which has zero elements 
everywhere except in the principal diagonal, the super- 
diagonal directly above it, and the subdiagonal directly 
below, is called a continuant matrix , and its determinant a 
continuant. For example 




(1) 
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are continuant matrices. In the second of these we have 
divided the subdiagoxjal. elements by suitable factors to 
make them ‘all -**1. This is the usual convention. 

The na m&continuant isdue to the intimate relation which 
the continuant matrix, in the above standard form, bears 
to the > t&ej>xy of cOntijmed fractions. Consider for example 
p sequehceof contmuedTractions, in the usual notation, 


j = 0 , 1 , 2 , 


n , 


iJ 

CqU- £i~t ^ 

|h# fi^% n of these being convergents to the final one. 
is not difficult to show that the numerators pj satisfy the 
set of recurrence relations 


(2) 

It 


= 1 ^i =r c&h Pi = CiP^+d^^ j> 1, (3) 
while the denominators satisfy the very similar relations 


9o = c o> ?i = <h<7o+ d i> •> = ¥m+^ 2 » i>l- (4) 

Now the continuants |C^| satisfy the same relations. 
For expanding \Cj\ according to its last row and column, 
and noting the initial values |6 f 0 | and |G\|, we have 

l^ol “ c o> \@i\ == c il^ol~Mi> \@j\ 5=5 (5) 


so that the continuants |C^| are equal to the denominators 
of convergents of the associated continued fraction. 

The numerators of the convergents are also continuants. 
For the cofaetor of the leading element c 0 in |C^|, let us say 
is a continuant. Expanding 6 it by a similar Cauchy 
expansion and noting initial values, we have 

lOl;o! ” c l» 

so that \Cj. 0 | = pj. Thus the numerator of any convergent 
is the cofactor ofrttie leading element in the continuant 
representing the denominator. 

Hence the continued fraction is equal to \G nt0 \/\G n \, 
which is the leading element in the reciprocal matrix 
C^" 1 . This fundamental result connects the theory of 
continuant matrices with the theory of continued fractions. 
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1. A continuant in standard form and with elements 
c ( = 1, considered as a polynomial, is prim© to its loading 
minor of order n, namely the cofactor of its last element c n . 

2. A continuant in standard form, with unit elements in 
the superdiagonal and integer elements in the diagonal, is 
prime to the cofactor of its first element. 


54. Jacobian, Hessian and Wronskian Matrices 

There is an important class of matrices the elements of 
which are functions and their derivatives. 

Consider n functions Uj of n independent variables x it 
the % possessing first partial derivatives within a certain 
region. Let us suppose first that the % obey a relation 

<f>{u v u 2 ..., u n ) ss <f> = 0, . . . (1) 

where <f> possesses first derivatives 8<j>!diij. 

Differentiating <j) with respect to the x { we obtain 

d _h ijtt ^ __ o (2) 

dXi 8u x dx { ‘ du 2 3x 4 * * * ‘ 8u n bx { ' 


These are n homogeneous equations in the <:<f>jdUj t 
and for their consistency the determinant \bu s jdx t \ must 
vanish. This determinant is called the Jacobian of the 
functions % with respect to the variables x {} and is often 
denoted by 

U I ’lit _ f 

( 3 ) 


S(Mj, u<i 


8[x v x 2 


*„) 


It -will be convenient to transpose the matrix of the 
system (2) and to regard [, dujdz ,], or briefly [dujdx], as 
the Jacobian matrix of the vector u with respect to the 
vector x. We may also consider m functions of n variables, 
with Jacobian matrix of order mxn. 


1. If y = Ax, then [by fix] ~ A. Thus [8x/8x] ~ I. 

2. If v = Au, then [dv/8, x] = A[du/ 8x]. 

3. If a: = Ay tho column vectors {BujcxJ and {8uj8ij ( } 
ftro related according to {8u!8y t } = A'{8uj8x { }. In fact 
\8uj8y] = [8u/8x] A. 
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4. Tho reader should verify the above results for cases 
of low on lor written in full, and for special functions. 
Jacob ians of chosen sets of functions should also be evaluated ; 


Change of Variables. Consider now m functions u i 
of n independent variables y 5 , these y 5 being functions of 
n independent x k . We have then [du/dy] and [dy/dx] ) 
of orders mxn and nxn. It is fundamental that 

[du/dy] [dy/dx] = [du/dx], . . . (4) 

In fact the ({, j) th element of the product on the left is 

dUi djh du t dy^ dUi dy^ fS) 

dlh dx 3 ^ dy 2 dy n dxf * 1 } 

which, by the differential calculus, is equal to dUi/dXj. 

It follows at once that [dy/dx] and [dxjdy] are reciprocal 
matrices, since by (4) their product is /. 

The reason for the notation (3) for a Jacobian now 
appears. For taking determinants of square Jacobian 
matrices in (5) we have \du/dy\ \dyjdx\ = \du/dx\ 9 which 
in the case n = 1 is the familiar formula for chango of 
variable in a derivative. In fact a Jacobian might be 
described as a multiple first differential coefficient of n 
functions with respect to n variables. 

This description is warranted by the fact (see for 
example Gillespie, Integration , § 16) that when a multiple 
differential dx 1 dx 2 ...dx n is transformed to new in- 
dependent variables y j the new differential element is 
\dx/dy \ dy 1 dy 2 ...dy n . The proof of this fundamental result 
is outside our scope, but the reader may consider the case 
of two variables, the transformation from rectangular 
coordinates x } y to other coordinates u, v, for example to 
polar coordinates r, 9. Through a point P in the (x, y) 
plane let curves -u = const., v = const, be drawn, and on 
them let neighbouring points Q and E be taken respectively. 
Let the corresponding curves through Q and R intersect in 

i 
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S. Then apart from higher infinitesimals, the coordinates 
of Q, R, S if P be taken as origin (0, 0), are 


( 8 *dv,^<h) (~du,^du) li 

\dv dv /, \cu du /, \c 


Z du \ C -~dv, ~- du * 
cu i v c u 


■ -%i,\ 

cv ). 


Then the area of the elementary triangle 
(19, Ex. 17) is found to be 

I dx 


i 


_ 

du du 
dx By 
dv dv 


dudv = \ dudv , 

1 &(u 7 V) 


0) 


and by taking S as origin we find the same value for 
SQR. Thus taking both triangles we have the element 
of area PQSR. 

Similar reasoning applies in three dimensions. The 
element of volume enclosed by neighbouring contours 
u = const., v = const., w = const, may be dissected into 
six equal elementary tetrahedra, the total volume being 
thus dudvdw multiplied by the Jacobian. 

Test of Functional Dependence. We have seen that 
functional dependence implies the vanishing of a Jacobian. 
The converse theorem is also true and is an important 
test of dependence, but the accurate proof of it exceeds our 
limits of space and in any case belongs more to function 
theory. 

5. Test by the Jacobian the functional dependence) of 

X x +Xi+£—X l X t ~X 1 X z --X& tt 
s , 3 . :t „ 


Hessian Matrix and Hessian. The Jacobian matrix, 
with respect to the x i} of the first derivatives dulbx i of a 
function u is called the Hessian matrix of u, its determinant 
the Hessian of %. The Hessian matrix is therefore 

chi 

ax fix K 


H = 


(8) 
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In a wide class of eases it is immaterial in which, order of 
different inf ion the second partial 5 derivatives are formed, 
and for such functions u the Hessian matrix is symmetric, 
//' H. 

If the variables x are transformed to new variables y 
by Ay t the cujdx £ are transformed (Ex. 3) according 
to {ru/fy^ — A'lcujcZi}, Hence, applying the second 
part of Ex, 3, we obtain the Jacobian matrix of the dujdy 5 
with respect to the y j7 in fact the Hessian matrix of u 
with respect to the y i5 as A' HA. The Hessian matrix thus 
undergoes congruent transformation, and the Hessian \I1\ 
is multiplied by \A\ % . This is usually expressed by saying 
that the Hessian is a covariant of u. 

6. Lot the X] be transformed to y j by a general trans- 

formation of Jacobian matrix M = [ctx/dy]. Show that the 
new //is where P is a certain matrix product. 

7. The Hessian matrix of a quadratic form x'Ax is 2A. 

S. Evaluate the Hessian of ax x -f Zbx x x % -f 3cx x xl + dx% and 
find also the Hessian of (ax x -\-bx 2 ) z . 

9- In the Taylor expansion of u(x l9 x 2 , x n ) about the 
origin {0, 0 ..., 0 } the term involving second partial derivatives 
of u is the quadratic form x'Hx/21. 

Wronskian Matrix and Wronskian. Before con- 
sidering the Wronskian we may note how to differentiate 
a determinant \A\ with respect to x, when each element 
is a function of x . In virtue of the rule 

^(/x/2/3) = + ~SJz + 

each term in the expansion of | A | gives rise, on being 
differentiated, to n terms, obtained by differentiating each 
factor a u in turn. Thus d\A | jdx contains n . nl terms. The 
aggregate of nl of these terms having differentiated elements 
belonging to the i th row forms a determinant obtained 
from \A j by writing da {j /dx for a iS in that row. Thus 
d\A\jdz is the surn of n determinants, obtained from \A\ 
by differentiating each row in turn. 



132 


SPECIAL DETERMINANTS 


10. Write out and verify this result lor determinants of 
the second and third orders. 

The Wronskian matrix of n functions of an 
independent variable x is defined by 

[d *" 1 U 3 ~ 
dx*~ l 


W « 


w 


iskian of n x . 


u x 

% 

u z 

du t 

du 2 

du z 

dx 

dx 

dx 

d~u x 

d*u 2 

d' z U 3 

“dx 2 

~dx 2 

lix * 


m 


(m 


The chief property of the Wronskian determinant or 
Wronskian is that it vanishes when the functions satisfy 
a relation of linear dependence with nonzero constant 
coefficients. For if we have 

a x u x + a 2 u 2 +a z u B = 0 

we have also two further equations of the* same form in 
the first and second derivatives oiu u u z . The condition 
of consistency of these equations in a u « 2 , a z is \W | ~~ O. 

The converse theorem, that if the Wronskian is zero 
the functions obey a non-trivial linear relation, in true 
under certain restrictions. 

11. The derivative of a Wronskian is obtained from tin: 
Wronskian by differentiating its hist row. 

12. Prove that the Wronskian of h j\ .*'% x 1 *- 1 is a 

confluent alternant multiplied by a cerium constant factor. 

13. Evaluate the Wronskians of I, .r 2 , ./* and of./’ 3 , ,r*\ .rb 

Construct other se ts of simpl e functions ami evaluate thrir 
Wronskians. “ 
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